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COMPOSITIONS AND METHODS 
RELATING TO LUNG SPECIFIC GENES AND PROTEINS 

This application claims the benefit of priority from U.S. Provisional Application 
5 Serial No. 60/252,500 filed November 22, 2000, which is herein incorporated by 
reference in its entirety. 

FIELD OF THE INVENTION 
The present invention relates to newly identified nucleic acid molecules and 

10 polypeptides present in normal and neoplastic lung cells, including fragments, variants 
and derivatives of the nucleic acids and polypeptides. The present invention also relates 
to antibodies to the polypeptides of the invention, as well as agonists and antagonists of 
the polypeptides of the invention. The invention also relates to compositions comprising 
the nucleic acids, polypeptides, antibodies, variants, derivatives, agonists and antagonists 

15 of the invention and methods for the use of these compositions. These uses include 
identifying, diagnosing, monitoring, staging, imaging and treating lung cancer and non- 
cancerous disease states in lung, identifying lung tissue and monitoring and identifying 
and/or designing agonists and antagonists of polypeptides of the invention. The uses also 
include gene therapy, production of transgenic animals and cells, and production of 

20 engineered lung tissue for treatment and research. 

BACKGROUND OF THE INVENTION 
Throughout the last hundred years, the incidence of lung cancer has steadily 
increased, so much so that now in many countries, it is the most common cancer. In fact, 
lung cancer is the second most prevalent type of cancer for both men and women in the 

25 United States and is the most common cause of cancer death in both sexes. Lung cancer 
deaths have increased ten-fold in both men and women since 1930, primarily due to an 
increase in cigarette smoking, but also due to an increased exposure to arsenic, asbestos, 
chromates, chloromethyl ethers, nickel, polycyclic aromatic hydrocarbons and other 
agents. See Scott, Lung Cancer: A Guide to Diagnosis and Treatment. Addicus Books 

30 (2000) and Alberg et al , in Kane et al (eds.) Biology of Lung Cancer, pp. 1 1 -52, Marcel 
Dekker, Inc. (1998). Lung cancer may result from a primary tumor originating in the 
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lung or a secondary tumor which has spread from another organ such as the bowel or 
breast. Although there are over a dozen types of lung cancer, over 90% fall into two 
categories: small cell lung cancer (SCLC) and non-small cell lung cancer (NSCLC). See 
Scott, supra. About 20-25% of all lung cancers are characterized as SCLC, while 70- 
5 80% are diagnosed as NSCLC. Id. A rare type of lung cancer is mesothelioma, which is 
generally caused by exposure to asbestos, and which affects the pleura of the lung. Lung 
cancer is usually diagnosed or screened for by chest x-ray, CAT scans, PET scans, or by 
sputum cytology. A diagnosis of lung cancer is usually confirmed by biopsy of the 
tissue. Id. 

10 SCLC tumors are highly metastatic and grow quickly. By the time a patient has 

been diagnosed with SCLC, the cancer has usually already spread to other parts of the 
body, including lymph nodes, adrenals, liver, bone, brain and bone marrow. See Scott, 
supra; Van Houtte et al (eds.), Progress and Perspective in the Treatment of Lung 
Cancer , Springer- Verlag (1999). Because the disease has usually spread to such an 

1 5 extent that surgery is not an option, the current treatment of choice is chemotherapy plus 
chest irradiation. See Van Houtte, supra. The stage of disease is a principal predictor of 
long-term survival. Less than 5% of patients with extensive disease that has spread 
beyond one lung and surrounding lymph nodes, live longer than two years. Id. 
However, the probability of five-year survival is three to four times higher if the disease 

20 is diagnosed and treated when it is still in a limited stage, i.e., not having spread beyond 
one lung. Id. 

NSCLC is generally divided into three types: squamous cell carcinoma, 
adenocarcinoma and large cell carcinoma. Both squamous cell cancer and 
adenocarcinoma develop from the cells that line the airways; however, adenocarcinoma 

25 develops from the goblet cells that produce mucus. Large cell lung cancer has been thus 
named because the cells look large and rounded when viewed microscopically, and 
generally are considered relatively undifferentiated. See Yesner, Atlas of Lung Cancer, 
Lippincott-Raven (1998). 

Secondary lung cancer is a cancer initiated elsewhere in the body that has spread 

30 to the lungs. Cancers that metastasize to the lung include, but are not limited to, breast 
cancer, melanoma, colon cancer and Hodgkin's lymphoma. Treatment for secondary 
lung cancer may depend upon the source of the original cancer. In other words, a lung 
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cancer that originated from breast cancer may be more responsive to breast cancer 
treatments and a lung cancer that originated from the colon cancer may be more 
responsive to colon cancer treatments. 

The stage of a cancer indicates how far it has spread and is an important indicator 
5 of the prognosis. La addition, staging is important because treatment is often decided 
according to the stage of a cancer. SCLC is divided into two stages: limited disease, i.e., 
cancer that can only be seen in one lung and in nearby lymph nodes; and extensive 
disease, i.e., cancer that has spread outside the lung to the chest or to other parts of the 
body. For most patients with SCLC, the disease has already progressed to lymph nodes 

10 or elsewhere in the body at the time of diagnosis. See Scott, supra. Even if spreading is 
not apparent on the scans, it is likely that some cancer cells may have spread away and 
traveled through the bloodstream or lymph system. In general, chemotherapy with or 
without radiotherapy is often the preferred treatment. The initial scans and tests done at 
first will be used later to see how well a patient is responding to treatment. 

15 In contrast, non-small cell cancer may be divided into four stages. Stage I is 

highly localized cancer with no cancer in the lymph nodes. Stage II cancer has spread to 
the lymph nodes at the top of the affected lung. Stage HI cancer has spread near to where 
the cancer started. This can be to the chest wall, the covering of the lung (pleura), the 
middle of the chest (mediastinum) or other lymph nodes. Stage IV cancer has spread to 

20 another part of the body. Stage I-HI cancer is usually treated with surgery, with or 
without chemotherapy. Stage IV cancer is usually treated with chemotherapy and/or 
palliative care. 

A number of chromosomal and genetic abnormalities have been observed in lung 
cancer. In NSCLC, chromosomal aberrations have been described on 3p, 9p, 1 lp, 15p 

25 and 17p, and chromosomal deletions have been seen on chromosomes 7, 11, 13 and 19. 
See Skarin (ed.), Multimodalitv Treatment of Lung Cancer, Marcel Dekker, Inc. (2000); 
Gemmill et al., pp. 465-502, in Kane, supra; Bailey-Wilson et al, pp. 53-98, in Kane, 
supra. Chromosomal abnormalities have been described on lp, 3p, 5q, 6q, 8q, 13q and 
17p in SCLC. Id. The loss of the short arm of chromosome 3p has also been seen in 

30 greater than 90% of SCLC tumors and approximately 50% of NSCLC tumors. Id. 

A number of oncogenes and tumor suppressor genes have been implicated in lung 
cancer. See Mabry, pp. 391-412, in Kane, supra and Sclafani et al, pp. 295-316, in 
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Kane, supra. In both SCLC and NSCLC, the p53 tumor suppressor gene is mutated in 
over 50% of lung cancers. See Yesner, supra. Another tumor suppressor gene, FHIT, 
which is found on chromosome 3p, is mutated by tobacco smoke. Id. ; Skarin, supra. In 
addition, more than 95% of SCLCs and approximately 20-60% of NSCLCs have an 
5 absent or abnormal retinoblastoma (Rb) protein, another tumor suppressor gene. The ras 
oncogene (particularly K~ras) is mutated in 20-30% of NSCLC specimens and the c- 
erbBl oncogene is expressed in 18% of stage 2 NSCLC and 60% of stage 4 NSCLC 
specimens. See Van Houtte, supra. Other tumor suppressor genes that are found in a 
region of chromosome 9, specifically in the region of 9p21, are deleted in many cancer 

10 cells, including $16™^ and plS™^ 5 . See Bailey- Wilson, supra\ Sclafani et al 9 supra. 
These tumor suppressor genes may also be implicated in lung cancer pathogenesis. 

In addition, many lung cancer cells produce growth factors that may act in an 
autocrine fashion on lung cancer cells. See Siegfried et al, pp. 317-336, in Kane, supra; 
Moody, pp. 337-370, in Kane, supra and Heasley et al, 371-390, in Kane, supra. In 

15 SCLC, many tumor cells produce gastrin-releasing peptide (GRP), which is a 

proliferative growth factor for these cells. See Skarin, supra. Many NSCLC tumors 
express epidermal growth factor (EGF) receptors, allowing NSCLC cells to proliferate in 
response to EGF. Insulin-like growth factor (IGF-I) is elevated in greater than 95% of 
SCLC and greater than 80% of NSCLC tumors; it is thought to function as an autocrine 

20 growth factor. Id. Finally, stem cell factor (SCF, also known as steel factor or kit ligand) 
and c-Kit (a proto-oncoprotein tyrosine kinase receptor for SCF) are both expressed at 
high levels in SCLC, and thus may form an autocrine loop that increases proliferation. 
Id. 

Although the majority of lung cancer cases are attributable to cigarette smoking, 
25 most smokers do not develop lung cancer. Epidemiological evidence has suggested that 
susceptibility to lung cancer may be inherited in a Mendelian fashion, and thus have an 
inherited genetic component. Bailey- Wilson, supra. Thus, it is thought that certain 
allelic variants at some genetic loci may affect susceptibility to lung cancer. Id. One way 
to identify which allelic variants are likely to be involved in lung cancer susceptibility, as 
30 well as susceptibility to other diseases, is to look at allelic variants of genes that are 
highly expressed in lung. 
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The lung is susceptible to a number of other debilitating diseases as well, 
including, without limitation, emphysema, pneumonia, cystic fibrosis and asthma. See 
Stockley (ed.), Molecular Biology of the Lung, Volume I: Emphysema and Infection, 
Birkhauser Verlag (1999), hereafter Stockley I, and Stockley (ed.)> Molecular Biology of 
5 the Lung, Volume II: Asthma and Cancer, Birkhauser Verlag (1999), hereafter Stockley 
II. The cause of many these disorders is still not well understood and there are few, if 
any, good treatment options for many of these noncancerous lung disorders. Thus, there 
also remains a need for understanding of various noncancerous lung disorders and for 
identify treatments for these diseases. 

10 The development and differentiation of the lung tissue during embryonic 

development is also very important. All of the epithelial cells of the respiratory tract, 
including those of the lung and bronchi, are derived from the primitive endodermal cells 
that line the embryonic outpouching. See Yesner, supra. During embryonic 
development, multipotent endodermal stem cells differentiate into many different types 

1 5 of specialized cells, which include ciliated cells for moving inhaled particles, goblet cells 
for producing mucus, Kulchitsky's cells for endocrine function, and Clara cells and type 
II pneumocytes for secreting surfactant protein. Id. Improper development and 
differentiation may cause respiratory disorders and distress in infants, particularly in 
premature infants, whose lungs cannot produce sufficient surfactant when they are born. 

20 Further, some lung cancer cells, particularly small cell carcinomas, appear multipotent, 
and can spontaneously differentiate into a number of cell types, including small cell 
carcinoma, adenocarcinoma and squamous cell carcinoma. Id. Thus, a better 
understanding of lung development and differentiation may help facilitate understanding 
of lung cancer initiation and progression. 

25 Accordingly, there is a great need for more sensitive and accurate methods for 

predicting whether a person is likely to develop lung cancer, for diagnosing lung cancer, 
for monitoring the progression of the disease, for staging the lung cancer, for determining 
whether the lung cancer has metastasized and for imaging the lung cancer. There is also 
a need for better treatment of lung cancer. There is also a great need for diagnosing and 

30 treating noncancerous lung disorders such as emphysema, pneumonia, lung infection, 
pulmonary fibrosis, cystic fibrosis and asthma. There is also a need for compositions and 
methods of using compositions that are capable of identifying lung tissue for forensic 
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purposes and for determining whether a particular cell or tissue exhibits lung-specific 
characteristics. 



SUMMARY OF THE INVENTION 
5 The present invention solves these and other needs in the art by providing nucleic 

acid molecules and polypeptides as well as antibodies, agonists and antagonists, thereto 
that may be used to identify, diagnose, monitor, stage, image and treat lung cancer and 
non-cancerous disease states in lung; identify and monitor lung tissue; and identify and 
design agonists and antagonists of polypeptides of the invention. The invention also 

10 provides gene therapy, methods for producing transgenic animals and cells, and methods 
for producing engineered lung tissue for treatment and research. 

Accordingly, one object of the invention is to provide nucleic acid molecules that 
are specific to lung cells, lung tissue and/or the lung organ. These lung specific nucleic 
acids (LSNAs) may be a naturally-occurring cDNA, genomic DNA, KNA, or a fragment 

15 of one of these nucleic acids, or may be a non-naturally-occurring nucleic acid molecule. 
If the LSNA is genomic DNA, then the LSNA is a lung specific gene (LSG). In a 
preferred embodiment, the nucleic acid molecule encodes a polypeptide that is specific to 
lung. In a more preferred embodiment, the nucleic acid molecule encodes a polypeptide 
that comprises an amino acid sequence of SEQ ID NO: 165 through 284. In another 

20 highly preferred embodiment, the nucleic acid molecule comprises a nucleic acid 

sequence of SEQ ID NO: 1 through 164. By nucleic acid molecule, it is also meant to be 
inclusive of sequences that selectively hybridize or exhibit substantial sequence 
similarity to a nucleic acid molecule encoding an LSP, or that selectively hybridize or 
exhibit substantial sequence similarity to an LSNA, as well as allelic variants of a nucleic 

25 acid molecule encoding an LSP, and allelic variants of an LSNA. Nucleic acid 
molecules comprising a part of a nucleic acid sequence that encodes an LSP or that 
comprises a part of a nucleic acid sequence of an LSNA are also provided. 

A related object of the present invention is to provide a nucleic acid molecule 
comprising one or more expression control sequences controlling the transcription and/or 

30 translation of all or a part of an LSNA. In a preferred embodiment, the nucleic acid 
molecule comprises one or more expression control sequences controlling the 
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transcription and/or translation of a nucleic acid molecule that encodes all or a fragment 
ofanLSP. 

Another object of the invention is to provide vectors and/or host cells comprising 
a nucleic acid molecule of the instant invention. In a preferred embodiment, the nucleic 
5 acid molecule encodes all or a fragment of an LSP. In another preferred embodiment, 
the nucleic acid molecule comprises all or a part of an LSNA. 

Another object of the invention is to provided methods for using the vectors and 
host cells comprising a nucleic acid molecule of the instant invention to recombinantly 
produce polypeptides of the invention. 

10 Another object of the invention is to provide a polypeptide encoded by a nucleic 

acid molecule of the invention. In a preferred embodiment, the polypeptide is an LSP. 
The polypeptide may comprise either a fragment or a full-length protein as well as a 
mutant protein (mutein), fusion protein, homologous protein or a polypeptide encoded by 
an allelic variant of an LSP. 

15 Another object of the invention is to provide an antibody that specifically binds to 

a polypeptide of the instant invention.. 

Another object of the invention is to provide agonists and antagonists of the 
nucleic acid molecules and polypeptides of the instant invention. 

Another object of the invention is to provide methods for using the nucleic acid 

20 molecules to detect or amplify nucleic acid molecules that have similar or identical 
nucleic acid sequences compared to the nucleic acid molecules described herein. In a 
preferred embodiment, the invention provides methods of using the nucleic acid 
molecules of the invention for identifying, diagnosing, monitoring, staging, imaging and 
treating lung cancer and non-cancerous disease states in lung. In another preferred 

25 embodiment, the invention provides methods of using the nucleic acid molecules of the 
invention for identifying and/or monitoring lung tissue. The nucleic acid molecules of 
the instant invention may also be used in gene therapy, for producing transgenic animals 
and cells, and for producing engineered lung tissue for treatment and research. 

The polypeptides and/or antibodies of the instant invention may also be used to 

30 identify, diagnose, monitor, stage, image and treat lung cancer and non-cancerous disease 
states in lung. The invention provides methods of using the polypeptides of the invention 
to identify and/or monitor lung tissue, and to produce engineered lung tissue. 



J 
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The agonists and antagonists of the instant invention may be used to treat lung 
cancer and non-cancerous disease states in lung and to produce engineered lung tissue. 

Yet another object of the invention is to provide a computer readable means of 
storing the nucleic acid and amino acid sequences of the invention. The records of the 
5 computer readable means can be accessed for reading and displaying of sequences for 
comparison, alignment and ordering of the sequences of the invention to other sequences. 

DETAILED DESCRIPTION OF THE INVENTION 
Definitions and General Techniques 

Unless otherwise defined herein, scientific and technical terms used in connection 

10 with the present invention shall have the meanings that are commonly understood by 
those of ordinary skill in the art. Further, unless otherwise required by context, singular 
terms shall include pluralities and plural terms shall include the singular. Generally, 
nomenclatures used in connection with, and techniques of, cell and tissue culture, 
molecular biology, immunology, microbiology, genetics and protein and nucleic acid 

15 chemistry and hybridization described herein are those well-known and commonly used 
in the art. The methods and techniques of the present invention are generally performed 
according to conventional methods well-known in the art and as described in various 
general and more specific references that are cited and discussed throughout the present 
specification unless otherwise indicated. See, e.g., Sambrook et al 9 Molecular Cloning: 

20 A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press (1 989) and 

Sambrook et aL Molecular Cloning: A Laboratory Manual 3d ed., Cold Spring Harbor 
Press (2001); Ausubel et aL 9 Current Protocols in Molecular Biology, Greene Publishing 
Associates (1992, and Supplements to 2000); Ausubel et aL, Short Protocols in 
Molecular Biology: A Compendium of Methods from Current Protocols in Molecular 

25 Biology-^ Ed., Wiley & Sons (1999); Harlow and Lane, Antibodies: A Laboratory 
Manual Cold Spring Harbor Laboratory Press (1990); and Harlow and Lane, Using 
Antibodies: A Laboratory Manual Cold Spring Harbor Laboratory Press (1999); each of 
which is incorporated herein by reference in its entirety. 

Enzymatic reactions and purification techniques are performed according to 

30 manufacturer's specifications, as commonly accomplished in the art or as described 
herein. The nomenclatures used in connection with, and the laboratory procedures and 
techniques of, analytical chemistry, synthetic organic chemistry, and medicinal and 
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phannaceutical chemistry described herein are those well-known and commonly used in 
the art. Standard techniques are used for chemical syntheses, chemical analyses, 
pharmaceutical preparation, formulation, and delivery, and treatment of patients. 

The following terms, unless otherwise indicated, shall be understood to have the 
5 following meanings: 

A "nucleic acid molecule" of this invention refers to a polymeric form of 
nucleotides and includes both sense and antisense strands of RNA, cDNA, genomic 
DNA, and synthetic forms and mixed polymers of the above. A nucleotide refers to a 
ribonucleotide, deoxynucleotide or a modified form of either type of nucleotide. A 

10 "nucleic acid molecule" as used herein is synonymous with "nucleic acid" and 

"polynucleotide." The term "nucleic acid molecule" usually refers to a molecule of at 
least 10 bases in length, unless otherwise specified. The term includes single- and 
double-stranded forms of DNA. In addition, a polynucleotide may include either or both 
naturally-occurring and modified nucleotides linked together by naturally-occurring 

1 5 and/or non-naturally occurring nucleotide linkages. 

The nucleic acid molecules may be modified chemically or biochemically or may 
contain non-natural or derivatized nucleotide bases, as will be readily appreciated by 
those of skill in the art. Such modifications include, for example, labels, methylation, 
substitution of one or more of the naturally occurring nucleotides with an analog, 

20 internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, 
phosphotri esters, phosphoramidates, carbamates, etc.), charged linkages (e.g., 
phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides), 
intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages 
(e.g., alpha anomeric nucleic acids, etc.) The term "nucleic acid molecule" also includes 

25 any topological conformation, including single-stranded, double-stranded, partially 
duplexed, triplexed, hairpinned, circular and padlocked conformations. Also included 
are synthetic molecules that mimic polynucleotides in their ability to bind to a designated 
sequence via hydrogen bonding and other chemical interactions. Such molecules are 
known in the art and include, for example, those in which peptide linkages substitute for 

30 phosphate linkages in the backbone of the molecule. 

A "gene" is defined as a nucleic acid molecule that comprises a nucleic acid 
sequence that encodes a polypeptide and the expression control sequences that surround 
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the nucleic acid sequence that encodes the polypeptide. For instance, a gene may 
comprise a promoter, one or more enhancers, a nucleic acid sequence that encodes a 
polypeptide, downstream regulatory sequences and, possibly, other nucleic acid 
sequences involved in regulation of the expression of an KNA. As is well-known in the 
5 art, eukaryotic genes usually contain both exons and introns. The term "exon" refers to a 
nucleic acid sequence found in genomic DNA that is bioinformatically predicted and/or 
experimentally confirmed to contribute a contiguous sequence to a mature mRNA 
transcript. The term "intron" refers to a nucleic acid sequence found in genomic DNA 
that is predicted and/or confirmed to not contribute to a mature mRNA transcript, but 

10 rather to be "spliced out" during processing of the transcript. 

A nucleic acid molecule or polypeptide is "derived" from a particular species if 
the nucleic acid molecule or polypeptide has been isolated from the particular species, or 
if the nucleic acid molecule or polypeptide is homologous to a nucleic acid molecule or 
polypeptide isolated from a particular species. 

15 An "isolated" or "substantially pure" nucleic acid or polynucleotide {e.g. , an 

RNA, DNA or a mixed polymer) is one which is substantially separated from other 
cellular components that naturally accompany the native polynucleotide in its natural 
host cell, e.g., ribosomes, polymerases, or genomic sequences with which it is naturally 
associated. The term embraces a nucleic acid or polynucleotide that (1) has been 

20 removed from its naturally occurring environment, (2) is not associated with all or a 
portion of a polynucleotide in which the "isolated polynucleotide" is found in nature, (3) 
is operatively linked to a polynucleotide which it is not linked to in nature, (4) does not 
occur in nature as part of a larger sequence or (5) includes nucleotides or internucleoside 
bonds that are not found in nature. The term "isolated" or "substantially pure" also can 

25 be used in reference to recombinant or cloned DNA isolates, chemically synthesized 
polynucleotide analogs, or polynucleotide analogs that are biologically synthesized by 
heterologous systems. The term "isolated nucleic acid molecule" includes nucleic acid 
molecules that are integrated into a host cell chromosome at a heterologous site, 
recombinant fusions of a native fragment to a heterologous sequence, recombinant 

30 vectors present as episomes or as integrated into a host cell chromosome. 

A "part" of a nucleic acid molecule refers to a nucleic acid molecule that 
comprises a partial contiguous sequence of at least 10 bases of the reference nucleic acid 
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molecule. Preferably, a part comprises at least 15 to 20 bases of a reference nucleic acid 
molecule. In theory, a nucleic acid sequence of 17 nucleotides is of sufficient length to 
occur at random less frequently than once in the three gigabase human genome, and thus 
to provide a nucleic acid probe that can uniquely identify the reference sequence in a 

5 nucleic acid mixture of genomic complexity. A preferred part is one that comprises a 
nucleic acid sequence that can encode at least 6 contiguous amino acid sequences 
(fragments of at least 18 nucleotides) because they are useful in directing the expression 
or synthesis of peptides that are useful in mapping the epitopes of the polypeptide 
encoded by the reference nucleic acid. See, e.g., Geysen et al, Proa Natl Acad, ScL 

10 USA 81:3998-4002 (1984); and United States Patent Nos. 4,708,871 and 5,595,915, the 
disclosures of which are incorporated herein by reference in their entireties. A part may 
also comprise at least 25, 30, 35 or 40 nucleotides of a reference nucleic acid molecule, 
or at least 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 500 nucleotides of a 
reference nucleic acid molecule. A part of a nucleic acid molecule may comprise no other 

15 nucleic acid sequences. Alternatively, a part of a nucleic acid may comprise other 
nucleic acid sequences from other nucleic acid molecules. 

The term "oligonucleotide" refers to a nucleic acid molecule generally 
comprising a length of 200 bases or fewer. The term often refers to single-stranded 
deoxyribonucleotides, but it can refer as well to single- or double-stranded 

20 ribonucleotides, RNA:DNA hybrids and double-stranded DNAs, among others. 

Preferably, oligonucleotides are 10 to 60 bases in length and most preferably 12, 13, 14, 
15, 16, 17, 18, 19 or 20 bases in length. Other preferred oligonucleotides are 25, 30, 35, 
40, 45, 50, 55 or 60 bases in length. Oligonucleotides may be single-stranded, e.g. for 
use as probes or primers, or may be double-stranded, e.g. for use in the construction of a 

25 mutant gene. Oligonucleotides of the invention can be either sense or antisense 

oligonucleotides. An oligonucleotide can be derivatized or modified as discussed above 
for nucleic acid molecules. 

Oligonucleotides, such as single-stranded DNA probe oligonucleotides, often are 
synthesized by chemical methods, such as those implemented on automated 

30 oligonucleotide synthesizers. However, oligonucleotides can be made by a variety of 
other methods, including in vitro recombinant DNA-mediated techniques and by 
expression of DNAs in cells and organisms. Initially, chemically synthesized DNAs 
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typically are obtained without a 5' phosphate. The 5 s ends of such oligonucleotides are 
not substrates for phosphodiester bond formation by ligation reactions that employ DNA 
ligases typically used to form recombinant DNA molecules. Where ligation of such 
oligonucleotides is desired, a phosphate can be added by standard techniques, such as 
5 those that employ a kinase and ATP. The 3' end of a chemically synthesized 

oligonucleotide generally has a free hydroxyl group and, in the presence of a ligase, such 
as T4 DNA ligase, readily will form a phosphodiester bond with a 5' phosphate of 
another polynucleotide, such as another oligonucleotide. As is well-known, this reaction 
can be prevented selectively, where desired, by removing the 5' phosphates of the other 

1 0 polynucleotide(s) prior to ligation. 

The term "naturally-occurring nucleotide" referred to herein includes naturally- 
occurring deoxyribonucleotides and ribonucleotides. The term "modified nucleotides" 
referred to herein includes nucleotides with modified or substituted sugar groups and the 
like. The term "nucleotide linkages" referred to herein includes nucleotides linkages 

15 such as phosphorothioate, phosphorodithioate, phosphoroselenoate, 

phosphorodiselenoate, phosphoroanilothioate, phoshoraniladate, phosphoroamidate, and 
the like. See e.g., LaPlanche et al Nucl Acids Res. 14:9081-9093 (1986); Stein et al 
Nucl Acids Res. 16:3209-3221 (1988); Zon et al Anti-Cancer Drug Design 6:539-568 
(1991); Zon et al, in Eckstein (ed.) Oligonucleotides and Analogues: A Practical 

20 Approach , pp. 87-108, Oxford University Press (1991); United States Patent No. 

5,151,510; Uhlmann and Peyman Chemical Reviews 90:543 (1990), the disclosures of 
which are hereby incorporated by reference. 

Unless specified otherwise, the left hand end of a polynucleotide sequence in 
sense orientation is the 5' end and the right hand end of the sequence is the 3' end. In 

25 addition, the left hand direction of a polynucleotide sequence in sense orientation is 
referred to as the 5' direction, while the right hand direction of the polynucleotide 
sequence is referred to as the 3* direction. Further, unless otherwise indicated, each 
nucleotide sequence is set forth herein as a sequence of deoxyribonucleotides. It is 
intended, however, that the given sequence be interpreted as would be appropriate to the 

30 polynucleotide composition: for example, if the isolated nucleic acid is composed of 
RNA, the given sequence intends ribonucleotides, with uridine substituted for thymidine. 
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The term "allelic variant" refers to one of two or more alternative naturally- 
occurring forms of a gene, wherein each gene possesses a unique nucleotide sequence. 
In a preferred embodiment, different alleles of a given gene have similar or identical 
biological properties. 

5 The term '^percent sequence identity" in the context of nucleic acid sequences 

refers to the residues in two sequences which are the same when aligned for maximum 
correspondence. The length of sequence identity comparison may be over a stretch of at 
least about nine nucleotides, usually at least about 20 nucleotides, more usually at least 
about 24 nucleotides, typically at least about 28 nucleotides, more typically at least about 

10 32 nucleotides, and preferably at least about 36 or more nucleotides. There are a number 
of different algorithms known in the art which can be used to measure nucleotide 
sequence identity. For instance, polynucleotide sequences can be compared using 
FASTA, Gap or Bestfit, which are programs in Wisconsin Package Version 10.0, 
Genetics Computer Group (GCG), Madison, Wisconsin. FASTA, which includes, e.g., 

15 the programs FASTA2 and FASTA3, provides alignments and percent sequence identity 
of the regions of the best overlap between the query and search sequences (Pearson, 
Methods Enzymol 183: 63-98 (1990); Pearson, Methods Mol Biol 132: 185-219 (2000); 
Pearson, Methods Enzymol 266: 227-258 (1996); Pearson, /. Mol Biol 276: 71-84 
(1998); herein incorporated by reference). Unless otherwise specified, default 

20 parameters for a particular program or algorithm are used. For instance, percent 

sequence identity between nucleic acid sequences can be determined using FASTA with 
its default parameters (a word size of 6 and the NOP AM factor for the scoring matrix) or 
using Gap with its default parameters as provided in GCG Version 6.1, herein 
incorporated by reference. 

25 A reference to a nucleic acid sequence encompasses its complement unless 

otherwise specified. Thus, a reference to a nucleic acid molecule having a particular 
sequence should be understood to encompass its complementary strand, with its 
complementary sequence. The complementary strand is also useful, e.g., for antisense 
therapy, hybridization probes and PCR primers. 

30 In the molecular biology art, researchers use the terms "percent sequence 

identity*', **percent sequence similarity" and "percent sequence homology" 
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interchangeably. In this application, these terms shall have the same meaning with 
respect to nucleic acid sequences only. 

The term "substantial similarity" or "substantial sequence similarity," when 
referring to a nucleic acid or fragment thereof, indicates that, when optimally aligned 
5 with appropriate nucleotide insertions or deletions with another nucleic acid (or its 

complementary strand), there is nucleotide sequence identity in at least about 50%, more 
preferably 60% of the nucleotide bases, usually at least about 70%, more usually at least 
about 80%, preferably at least about 90%, and more preferably at least about 95-98% of 
the nucleotide bases, as measured by any well-known algorithm of sequence identity, 

10 such as FASTA, BLAST or Gap, as discussed above. 

Alternatively, substantial similarity exists when a nucleic acid or fragment thereof 
hybridizes to another nucleic acid, to a strand of another nucleic acid, or to the 
complementary strand thereof, under selective hybridization conditions. Typically, 
selective hybridization will occur when there is at least about 55% sequence identity, 

15 preferably at least about 65%, more preferably at least about 75%, and most preferably at 
least about 90% sequence identity, over a stretch of at least about 14 nucleotides, more 
preferably at least 17 nucleotides, even more preferably at least 20, 25, 30, 35, 40, 50, 60, 
70, 80, 90 or 100 nucleotides. 

Nucleic acid hybridization will be affected by such conditions as salt 

20 concentration, temperature, solvents, the base composition of the hybridizing species, 
length of the complementary regions, and the number of nucleotide base mismatches 
between the hybridizing nucleic acids, as will be readily appreciated by those skilled in 
the art. "Stringent hybridization conditions" and "stringent wash conditions" in the 
context of nucleic acid hybridization experiments depend upon a number of different 

25 physical parameters. The most important parameters include temperature of 

hybridization, base composition of the nucleic acids, salt concentration and length of the 
nucleic acid. One having ordinary skill in the art knows how to vary these parameters to 
achieve a particular stringency of hybridization. In general, "stringent hybridization" is 
performed at about 25°C below the thermal melting point (Tm) for the specific DNA 

30 hybrid under a particular set of conditions. "Stringent washing" is performed at 

temperatures about 5°C lower than the T m for the specific DNA hybrid under a particular 
set of conditions. The T m is the temperature at which 50% of the target sequence 
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hybridizes to a perfectly matched probe. See Sambrook (1 989), supra, p. 9.5 1 , hereby 
incorporated by reference. 

The T m for a particular DNA-DNA hybrid can be estimated by the formula: 
T m = 81.5°C + 16.6 (logi 0 [Na + ]) 4- 0.41 (fraction G + C) - 0.63 (% formamide) - (600/1) 
5 where 1 is the length of the hybrid in base pairs. 

The T m for a particular RNA-RNA hybrid can be estimated by the formula: 
T m - 79.8°C + 18.5 (logioCNa""]) + 0.58 (fraction G + C) + 11.8 (fraction G + C) 2 - 0.35 
(% formamide) - (820/1). 

The T m for a particular RNA-DNA hybrid can be estimated by the formula: 

10 T m = 79.8°C + 18.5(logi 0 [Na + ]) + 0.58 (fraction G + C) + 1 1 .8 (fraction G + C) 2 - 0.50 
(% formamide) - (820/1). 

In general, the T m decreases by 1-1 .5°C for each 1% of mismatch between two 
nucleic acid sequences. Thus, one having ordinary skill in the art can alter hybridization 
and/or washing conditions to obtain sequences that have higher or lower degrees of 

1 5 sequence identity to the target nucleic acid. For instance, to obtain hybridizing nucleic 
acids that contain up to 10% mismatch from the target nucleic acid sequence, 10-15°C 
would be subtracted from the calculated T m of a perfectly matched hybrid, and then the 
hybridization and washing temperatures adjusted accordingly. Probe sequences may also 
hybridize specifically to duplex DNA under certain conditions to form triplex or other 

20 higher order DNA complexes. The preparation of such probes and suitable hybridization 
conditions are well-known in the art. 

An example of stringent hybridization conditions for hybridization of 
complementary nucleic acid sequences having more than 100 complementary residues on 
a filter in a Southern or Northern blot or for screening a library is 50% formamide/6X 

25 SSC at 42°C for at least ten hours and preferably overnight (approximately 1 6 hours). 
Another example of stringent hybridization conditions is 6X SSC at 68°C without 
formamide for at least ten hours and preferably overnight. An example of moderate 
stringency hybridization conditions is 6X SSC at 55°C without formamide for at least ten 
hours and preferably overnight. An example of low stringency hybridization conditions 

30 for hybridization of complementary nucleic acid sequences having more than 1 00 
complementary residues on a filter in a Southern or Northern blot or for screening a 
library is 6X SSC at 42°C for at least ten hours. Hybridization conditions to identify 
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nucleic acid sequences that are similar but not identical can be identified by 
experimentally changing the hybridization temperature from 68°C to 42°C while keeping 
the salt concentration constant (6X SSC), or keeping the hybridization temperature and 
salt concentration constant (e.g. 42°C and 6X SSC) and varying the formamide 
5 concentration from 50% to 0%. Hybridization buffers may also include blocking agents 
to lower background. These agents are well-known in the art. See Sambrook et al. 
(1989), supra, pages 8.46 and 9.46-9.58, herein incorporated by reference. See also 
Ausubel (1992), supra, Ausubel (1999), supra, and Sambrook (2001), supra. 

Wash conditions also can be altered to change stringency conditions. An example 

10 of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes {see Sambrook 
(1989), supra, for SSC buffer). Often the high stringency wash is preceded by a low 
stringency wash to remove excess probe. An exemplary medium stringency wash for 
duplex DNA of more than 100 base pairs is lx SSC at 45°C for 15 minutes. An 
exemplary low stringency wash for such a duplex is 4x SSC at 40°C for 15 minutes. In 

15 general, signal-to-noise ratio of 2x or higher than that observed for an unrelated probe in 
the particular hybridization assay indicates detection of a specific hybridization. 

As defined herein, nucleic acid molecules that do not hybridize to each other 
under stringent conditions are still substantially similar to one another if they encode 
polypeptides that are substantially identical to each other. This occurs, for example, 

20 when a nucleic acid molecule is created synthetically or recombinantly using high codon 
degeneracy as permitted by the redundancy of the genetic code. 

Hybridization conditions for nucleic acid molecules that are shorter than 100 
nucleotides in length (e.g., for oligonucleotide probes) maybe calculated by the formula: 
T m = 81.5°C + 16.6(logi 0 [Na + ]) + 0.41(fraction G+C) -(600/N), 

25 wherein N is change length and the [Na*] is 1 M or less. See Sambrook (1989), supra, p. 
1 1 .46. For hybridization of probes shorter than 100 nucleotides, hybridization is usually 
performed under stringent conditions (5-10°C below the T m ) using high concentrations 
(0.1-1.0 pmol/ml) of probe. Id. at p. 11.45. Determination of hybridization using 
mismatched probes, pools of degenerate probes or "guessmers," as well as hybridization 

30 solutions and methods for empirically determining hybridization conditions are well- 
known in the art. See, e.g., Ausubel (1999), supra; Sambrook (1989), supra, pp. 1 1.45- 
11.57. 
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The term "digestion" or "digestion of DNA" refers to catalytic cleavage of the 
DNA with a restriction enzyme that acts only at certain sequences in the DNA. The 
various restriction enzymes referred to herein are commercially available and their 
reaction conditions, cofactors and other requirements for use are known and routine to 
5 the skilled artisan. For analytical purposes, typically, 1 |ig of plasmid or DNA fragment 
is digested with about 2 units of enzyme in about 20 |il of reaction buffer. For the 
purpose of isolating DNA fragments for plasmid construction, typically 5 to 50 jag of 
DNA are digested with 20 to 250 units of enzyme in proportionately larger volumes. 
Appropriate buffers and substrate amounts for particular restriction enzymes are 

10 described in standard laboratory manuals, such as those referenced below, and they are 
specified by commercial suppliers. Incubation times of about 1 hour at 37°C are 
ordinarily used, but conditions may vary in accordance with standard procedures, the 
supplier's instructions and the particulars of the reaction. After digestion, reactions may 
be analyzed, and fragments may be purified by electrophoresis through an agarose or 

15 polyacrylamide gel, using well-known methods that are routine for those skilled in the 
art. 

The term "ligation" refers to the process of forming phosphodiester bonds 
between two or more polynucleotides, which most often are double-stranded DNAS. 
Techniques for ligation are well-known to the art and protocols for ligation are described 

20 in standard laboratory manuals and references, such as, e.g., Sambrook (1989), supra. 

Genome-derived "single exon probes," are probes that comprise at least part of an 
exon ("reference exon") and can hybridize detectably under high stringency conditions to 
transcript-derived nucleic acids that include the reference exon but do not hybridize 
detectably under high stringency conditions to nucleic acids that lack the reference exon. 

25 Single exon probes typically further comprise, contiguous to a first end of the exon 
portion, a first intronic and/or intergenic sequence that is identically contiguous to the 
exon in the genome, and may contain a second intronic and/or intergenic sequence that is 
identically contiguous to the exon in the genome. The minimum length of genome- 
derived single exon probes is defined by the requirement that the exonic portion be of 

30 sufficient length to hybridize under high stringency conditions to transcript-derived 

nucleic acids, as discussed above. The maximum length of genome-derived single exon 
probes is defined by the requirement that the probes contain portions of no more than one 
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exon. The single exon probes may contain priming sequences not found in contiguity 
with the rest of the probe sequence in the genome, which priming sequences are useful 
for PCR and other amplification-based technologies. 

The term "microarray" or "nucleic acid microarray" refers to a substrate-bound 
5 collection of plural nucleic acids, hybridization to each of the plurality of bound nucleic 
acids being separately detectable. The substrate can be solid or porous, planar or non- 
planar, unitary or distributed. Microarrays or nucleic acid microarrays include all the 
devices so called in Schena (ed.), DNA Microarrays: A Practical Approach (Practical 
Approach Series) , Oxford University Press (1999); Nature Genet 21(l)(suppl.):l - 60 

10 (1 999); Schena (ed.), Microarrav Biochip: Tools and Technology, Eaton Publishing 
Company/BioTechniques Books Division (2000). These microarrays include substrate- 
bound collections of plural nucleic acids in which the plurality of nucleic acids are 
disposed on a plurality of beads, rather than on a unitary planar substrate, as is described, 
inter alia, in Brenner et al, Proc. Natl Acad. ScL USA 97(4):1665-1670 (2000). 

1 5 The term "mutated" when applied to nucleic acid molecules means that 

nucleotides in the nucleic acid sequence of the nucleic acid molecule may be inserted, 
deleted or changed compared to a reference nucleic acid sequence. A single alteration 
may be made at a locus (a point mutation) or multiple nucleotides may be inserted, 
deleted or changed at a single locus. In addition, one or more alterations may be made at 

20 any number of loci within a nucleic acid sequence. In a preferred embodiment, the 

nucleic acid molecule comprises the wild type nucleic acid sequence encoding an LSP or 
is an LSNA. The nucleic acid molecule may be mutated by any method known in the art 
including those mutagenesis techniques described infra. 

The term "error-prone PCR" refers to a process for performing PCR under 

25 conditions where the copying fidelity of the DNA polymerase is low, such that a high 
rate of point mutations is obtained along the entire length of the PCR product. See, e.g., 
Leung et al. y Technique 1:11-15 (1989) and Caldwell et al.,PCR Methods Applic. 2: 28- 
33 (1992). 

The term "oligonucleotide-directed mutagenesis" refers to a process which 
30 enables the generation of site-specific mutations in any cloned DNA segment of interest. 
See, e.g., Reidhaar-Olson et al, Science 241: 53-57 (1988). 



WO 02/068633 PCT/US01/43612 

-19- 

The term "assembly PGR" refers to a process which involves the assembly of a 
PCR product from a mixture of small DNA fragments. A large number of different PCR 
reactions occur in parallel in the same vial, with the products of one reaction priming the 
products of another reaction. 
5 The term "sexual PCR mutagenesis" or "DNA shuffling" refers to a method of 

error-prone PCR coupled with forced homologous recombination between DNA 
molecules of different but highly related DNA sequence in vitro, caused by random 
fragmentation of the DNA molecule based on sequence similarity, followed by fixation 
of the crossover by primer extension in an error-prone PCR reaction. See, e.g., Stemmer, 

10 Proc. Natl Acad. ScL U.S.A. 91: 10747-10751 (1994). DNA shuffling can be carried out 
between several related genes ("Family shuffling"). 

The term "zw vivo mutagenesis" refers to a process of generating random 
mutations in any cloned DNA of interest which involves the propagation of the DNA in a 
strain of bacteria such as E. coli that carries mutations in one or more of the DNA repair 

1 5 pathways. These "mutator" strains have a higher random mutation rate than that of a 
wild-type parent. Propagating the DNA in a mutator strain will eventually generate 
random mutations within the DNA. 

The term "cassette mutagenesis" refers to any process for replacing a small region 
of a double-stranded DNA molecule with a synthetic oligonucleotide "cassette" that 

20 differs from the native sequence. The oligonucleotide often contains completely and/or 
partially randomized native sequence. 

The term "recursive ensemble mutagenesis" refers to an algorithm for protein 
engineering (protein mutagenesis) developed to produce diverse populations of 
phenotypically related mutants whose members differ in amino acid sequence. This 

25 method uses a feedback mechanism to control successive rounds of combinatorial 

cassette mutagenesis. See, e.g., Arkin et al, Proc. Natl Acad. Sci. U.S.A. 89: 7811-7815 
(1992). 

The term "exponential ensemble mutagenesis" refers to a process for generating 
combinatorial libraries with a high percentage of unique and functional mutants, wherein 
30 small groups of residues are randomized in parallel to identify, at each altered position, 
amino acids which lead to functional proteins. See, e.g., Delegrave et al, Biotechnology 
Research 11: 1548-1552 (1993); Arnold, Current Opinion in Biotechnology 4: 450-455 
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(1993). Each of the references mentioned above are hereby incorporated by reference in 
its entirety. 

"Operatively linked" expression control sequences refers to a linkage in which the 
expression control sequence is contiguous with the gene of interest to control the gene of 
5 interest, as well as expression control sequences that act in trans or at a distance to 
control the gene of interest. 

The term "expression control sequence" as used herein refers to polynucleotide 
sequences which are necessary to affect the expression of coding sequences to which 
they are operatively linked. Expression control sequences are sequences which control 

10 the transcription, post-transcriptional events and translation of nucleic acid sequences. 
Expression control sequences include appropriate transcription initiation, termination, 
promoter and enhancer sequences; efficient RNA processing signals such as splicing and 
polyadenylation signals; sequences that stabilize cytoplasmic mRNA; sequences that 
enhance translation efficiency (e.g., ribosome binding sites); sequences that enhance 

15 protein stability; and when desired, sequences that enhance protein secretion. The nature 
of such control sequences differs depending upon the host organism; in prokaryotes, such 
control sequences generally include the promoter, ribosomal binding site, and 
transcription termination sequence. The tenn "control sequences" is intended to include, 
at a minimum, all components whose presence is essential for expression, and can also 

20 include additional components whose presence is advantageous, for example, leader 
sequences and fusion partner sequences. 

The term "vector," as used herein, is intended to refer to a nucleic acid molecule 
capable of transporting another nucleic acid to which it has been linked. One type of 
vector is a "plasmid", which refers to a circular double-stranded DNA loop into which 

25 additional DNA segments may be ligated. Other vectors include cosmids, bacterial 

artificial chromosomes (BAC) and yeast artificial chromosomes (YAC). Another type of 
vector is a viral vector, wherein additional DNA segments may be ligated into the viral 
genome. Viral vectors that infect bacterial cells are referred to as bacteriophages. 
Certain vectors are capable of autonomous replication in a host cell into which they are 

30 introduced (e.g. , bacterial vectors having a bacterial origin of replication). Other vectors 
can be integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors are capable 
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of directing the expression of genes to which they are operatively linked. Such vectors 
are referred to herein as "recombinant expression vectors" (or simply, "expression 
vectors"). In general, expression vectors of utility in recombinant DNA techniques are 
often in the form of plasmids. In the present specification, "plasmid" and "vector" may 
5 be used interchangeably as the plasmid is the most commonly used form of vector. 
However, the invention is intended to include other forms of expression vectors that 
serve equivalent functions. 

The term "recombinant host cell" (or simply "host cell"), as used herein, is 
intended to refer to a cell into which an expression vector has been introduced. It should 

10 be understood that such terms are intended to refer not only to the particular subject cell 
but to the progeny of such a cell. Because certain modifications may occur in succeeding 
generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term 
"host cell" as used herein. 

15 As used herein, .the phrase "open reading frame" and the equivalent acronym 

"ORF" refer to that portion of a transcript-derived nucleic acid that can be translated in 
its entirety into a sequence of contiguous amino acids. As so defined, an ORF has length, 
measured in nucleotides, exactly divisible by 3. As so defined, an ORF need not encode 
the entirety of a natural protein. 

20 As used herein, the phrase "ORF-encoded peptide" refers to the predicted or 

actual translation of an ORF. 

As used herein, the phrase "degenerate variant" of a reference nucleic acid 
sequence intends all nucleic acid sequences that can be directly translated, using the 
standard genetic code, to provide an amino acid sequence identical to that translated from 

25 the reference nucleic acid sequence. 

The term "polypeptide" encompasses both naturally-occurring and non-naturally- 
occurring proteins and polypeptides, polypeptide fragments and polypeptide mutants, 
derivatives and analogs. A polypeptide maybe monomeric or polymeric. Further, a 
polypeptide may comprise a number of different modules within a single polypeptide 

30 each of which has one or more distinct activities. A preferred polypeptide in accordance 
with the invention comprises an LSP encoded by a nucleic acid molecule of the instant 
invention, as well as a fragment, mutant, analog and derivative thereof. 
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The term "isolated protein" or "isolated polypeptide" is a protein or polypeptide 
that by virtue of its origin or source of derivation (1) is not associated with naturally 
associated components that accompany it in its native state, (2) is free of other proteins 
from the same species (3) is expressed by a cell from a different species, or (4) does not 
5 occur in nature. Thus, a polypeptide that is chemically synthesized or synthesized in a 
cellular system different from the cell from which it naturally originates will be 
"isolated" from its naturally associated components. A polypeptide or protein may also 
be rendered substantially free of naturally associated components by isolation, using 
protein purification techniques well-known in the art. 

10 A protein or polypeptide is "substantially pure," "substantially homogeneous" or 

"substantially purified" when at least about 60% to 75% of a sample exhibits a single 
species of polypeptide. The polypeptide or protein may be monomeric or multimeric. A 
substantially pure polypeptide or protein will typically comprise about 50%, 60%, 70%, 
80% or 90% WAV of a protein sample, more usually about 95%, and preferably will be 

1 5 over 99% pure. Protein purity or homogeneity may be indicated by a number of means 
well-known in the art, such as polyacrylamide gel electrophoresis of a protein sample, 
followed by visualizing a single polypeptide band upon staining the gel with a stain well- 
known in the art. For certain purposes, higher resolution may be provided by using 
HPLC or other means well-known in the art for purification. 

20 The term "polypeptide fragment" as used herein refers to a polypeptide of the 

instant invention that has an ammo-terminal and/or carboxy-terminal deletion compared 
to a full-length polypeptide. In a preferred embodiment, the polypeptide fragment is a 
contiguous sequence in which the amino acid sequence of the fragment is identical to the 
corresponding positions in the naturally-occurring sequence. Fragments typically are at 

25 least 5, 6, 7, 8, 9 or 10 amino acids long, preferably at least 12, 14, 16 or 18 amino acids 
long, more preferably at least 20 amino acids long, more preferably at least 25, 30, 35, 40 
or 45, amino acids, even more preferably at least 50 or 60 amino acids long, and even 
more preferably at least 70 amino acids long. 

A "derivative" refers to polypeptides or fragments thereof that are substantially 

30 similar in primary structural sequence but which include, e.g., in vivo or in vitro chemical 
and biochemical modifications that are not found in the native polypeptide. Such 
modifications include, for example, acetylation, acylation, ADP-ribosylation, amidation, 
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covalent attachment of flavin, covalent attachment of a heme moiety, covalent 
attachment of a nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid 
derivative, covalent attachment of phosphotidylinositol, cross-linking, cyclization, 
disulfide bond formation, demethylation, formation of covalent cross-links, formation of 

5 cystine, formation of pyroglutamate, formylation, gamma-carboxylation, glycosylation, 
GPI anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
proteolytic processing, phosphorylation, prenylation, racemization, selenoylation, 
sulfation, transfer-RNA mediated addition of amino acids to proteins such as 
arginylation, and ubiquitination. Other modification include, e.g., labeling with 

10 radionuclides, and various enzymatic modifications, as will be readily appreciated by 
those skilled in the art. A variety of methods for labeling polypeptides and of 
substituents or labels useful for such purposes are well-known in the art, and include 
radioactive isotopes such as 125 1, 32 P, 35 S, and 3 H, ligands which bind to labeled 
antiligands (e.g., antibodies), fluorophores, chemiluminescent agents, enzymes, and 

15 antiligands which can serve as specific binding pair members for a labeled ligand. The 
choice of label depends on the sensitivity required, ease of conjugation with the primer, 
stability requirements, and available instrumentation. Methods for labeling polypeptides 
are well-known in the art. See Ausubel (1992), supra; Ausubel (1999), supra, herein 
incorporated by reference. 

20 The term "fusion protein" refers to polypeptides of the instant invention 

comprising polypeptides or fragments coupled to heterologous amino acid sequences. 
Fusion proteins are useful because they can be constructed to contain two or more 
desired functional elements from two or more different proteins. A fusion protein 
comprises at least 10 contiguous amino acids from a polypeptide of interest, more 

25 preferably at least 20 or 30 amino acids, even more preferably at least 40, 50 or 60 amino 
acids, yet more preferably at least 75, 100 or 125 amino acids. Fusion proteins can be 
produced recombinantly by constructing a nucleic acid sequence which encodes the 
polypeptide or a fragment thereof in frame with a nucleic acid sequence encoding a 
different protein or peptide and then expressing the fusion protein. Alternatively, a 

30 fusion protein can be produced chemically by crosslinking the polypeptide or a fragment 
thereof to another protein. 
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The term "analog" refers to both polypeptide analogs and non-peptide analogs. 
The term "polypeptide analog" as used herein refers to a polypeptide of the instant 
invention that is comprised of a segment of at least 25 amino acids that has substantial 
identity to a portion of an amino acid sequence but which contains non-natural amino 
5 acids or non-natural inter-residue bonds. In a preferred embodiment, the analog has the 
same or similar biological activity as the native polypeptide. Typically, polypeptide 
analogs comprise a conservative amino acid substitution (or insertion or deletion) with 
respect to the naturally-occurring sequence. Analogs typically are at least 20 amino acids 
long, preferably at least 50 amino acids long or longer, and can often be as long as a 

1 0 full-length naturally-occurring polypeptide. 

The term "non-peptide analog" refers to a compound with properties that are 
analogous to those of a reference polypeptide of the instant invention. A non-peptide 
compound may also be termed a "peptide mimetic" or a "peptidomimetic." Such 
compounds are often developed with the aid of computerized molecular modeling. 

1 5 Peptide mimetics that are structurally similar to useful peptides may be used to produce 
an equivalent effect. Generally, peptidomimetics are structurally similar to a paradigm 
polypeptide (z.e., a polypeptide that has a desired biochemical property or 
pharmacological activity), but have one or more peptide linkages optionally replaced by a 
linkage selected from the group consisting of: --CH2NH--, -CH2S-, --CH2-CH2--, 

20 -CH=CH--(cis and trans), -COCH 2 --, -CH(OH)CH 2 «, and -CH 2 SO-, by methods 
well-known in the art. Systematic substitution of one or more amino acids of a 
consensus sequence with a D-amino acid of the same type (e.g. 9 D-lysine in place of 
L-lysine) may also be used to generate more stable peptides. In addition, constrained 
peptides comprising a consensus sequence or a substantially identical consensus 

25 sequence variation may be generated by methods known in the art (Rizo et al, Ann, Rev. 
Biochem. 61:387-418 (1992), incorporated herein by reference). For example, one may 
add internal cysteine residues capable of forming intramolecular disulfide bridges which 
cyclize the peptide. 

A "polypeptide mutant" or "mutein" refers to a polypeptide of the instant 

30 invention whose sequence contains substitutions, insertions or deletions of one or more 
amino acids compared to the amino acid sequence of a native or wild-type protein. A 
mutein may have one or more amino acid point substitutions, in which a single amino 
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acid at a position has been changed to another amino acid, one or more insertions and/or 
deletions, in which one or more amino acids are inserted or deleted, respectively, in the 
sequence of the naturally-occurring protein, and/or truncations of the amino acid 
sequence at either or both the amino or carboxy termini. Further, a mutein may have the 
5 same or different biological activity as the naturally-occurring protein. For instance, a 
mutein may have an increased or decreased biological activity. A mutein has at least 
50% sequence similarity to the wild type protein, preferred is 60% sequence similarity, 
more preferred is 70% sequence similarity. Even more preferred are muteins having 
80%, 85% or 90% sequence similarity to the wild type protein. In an even more 

10 preferred embodiment, a mutein exhibits 95% sequence identity, even more preferably 
97%, even more preferably 98% and even more preferably 99%. Sequence similarity 
may be measured by any common sequence analysis algorithm, such as Gap or Bestfit. 

Preferred amino acid substitutions are those which: (1) reduce susceptibility to 
proteolysis, (2) reduce susceptibility to oxidation, (3) alter binding affinity for forming 

15 protein complexes, (4) alter binding affinity or enzymatic activity, and (5) confer or 
modify other physicochernical or functional properties of such analogs. For example, 
single or multiple amino acid substitutions (preferably conservative amino acid 
substitutions) may be made in the naturally-occurring sequence (preferably in the portion 
of the polypeptide outside the domain(s) forming intermolecular contacts. In a preferred 

20 embodiment, the amino acid substitutions are moderately conservative substitutions or 
conservative substitutions. In a more preferred embodiment, the amino acid substitutions 
are conservative substitutions. A conservative amino acid substitution should not 
substantially change the structural characteristics of the parent sequence (e.g., a 
replacement amino acid should not tend to disrupt a helix that occurs in the parent 

25 sequence, or disrupt other types of secondary structure that characterizes the parent 

sequence). Examples of art-recognized polypeptide secondary and tertiary structures are 
described in Creighton (ed.), Proteins, Structures and Molecular Principles, W. H, 
Freeman and Company (1984); Branden et al (ed.), Introduction to Protein Structure , 
Garland Publishing (1991); Thornton et al, Nature 354:105-106 (1991), each of which 

30 are incorporated herein by reference. 

As used herein, the twenty conventional amino acids and their abbreviations 
follow conventional usage. See Golub et al (eds.), Immunology - A Synthesis 2 nd Ed., 
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Sinauer Associates (1991), which is incorporated herein by reference. Stereoisomers 
(e.g., D-amino acids) of the twenty conventional amino acids, unnatural amino acids such 
as -, -disubstituted amino acids, N-alkyl amino acids, and other unconventional amino 
acids may also be suitable components for polypeptides of the present invention. 
5 Examples of unconventional amino acids include: 4-hydroxyproline, y-carboxyglutamate, 
-N,N,N-trimethyllysine, -N-acetyllysine, O-phosphoserine, N-acetylserine, 
N-formylmethionine, 3-methylhistidine, 5-hydroxylysine, s-N-methylarginine, and other 
similar amino acids and imino acids (e.g., 4-hydroxyproline). In the polypeptide notation 
used herein, the lefthand direction is the amino terminal direction and the right hand 
10 direction is the carboxy-terminal direction, in accordance with standard usage and 
convention. 

A protein has "homology" or is "homologous" to a protein from another organism 
if the encoded amino acid sequence of the protein has a similar sequence to the encoded 
amino acid sequence of a protein of a different organism and has a similar biological 

15 activity or function. Alternatively, a protein may have homology or be homologous to 
another protein if the two proteins have similar amino acid sequences and have similar 
biological activities or functions. Although two proteins are said to be "homologous," 
this does not imply that there is necessarily an evolutionary relationship between the 
proteins. Instead, the term "homologous" is defined to mean that the two proteins have 

20 similar amino acid sequences and similar biological activities or functions. In a preferred 
embodiment, a homologous protein is one that exhibits 50% sequence similarity to the 
wild type protein, preferred is 60% sequence similarity, more preferred is 70% sequence 
similarity. Even more preferred are homologous proteins that exhibit 80%, 85% or 90% 
sequence similarity to the wild type protein. In a yet more preferred embodiment, a 

25 homologous protein exhibits 95%, 97%, 98% or 99% sequence similarity. 

When "sequence similarity" is used in reference to proteins or peptides, it is 
recognized that residue positions that are not identical often differ by conservative amino 
acid substitutions. In a preferred embodiment, a polypeptide that has "sequence 
similarity" comprises conservative or moderately conservative amino acid substitutions. 

30 A "conservative amino acid substitution" is one in which an amino acid residue is 
substituted by another amino acid residue having a side chain (R group) with similar 
chemical properties (e.g., charge or hydrophobicity). In general, a conservative amino 



WO 02/068633 PC1YUS0 1/4361 2 

-27- 

acid substitution will not substantially change the functional properties of a protein. In 
cases where two or more amino acid sequences differ from each other by conservative 
substitutions, the percent sequence identity or degree of similarity may be adjusted 
upwards to correct for the conservative nature of the substitution. Means for making this 
5 adjustment are well-known to those of skill in the art. See, e.g., Pearson, Methods Mol 
Biol 24: 307-31 (1994), herein incorporated by reference. 

For instance, the following six groups each contain amino acids that are 
conservative substitutions for one another: 
1) Serine (S), Threonine (T); 
1 0 2) Aspartic Acid (D), Glutamic Acid (E); 

3) Asparagine (N), Glutamine (Q); 

4) Arginine (R), Lysine (K); 

5) Isoleucine (I), Leucine (L), Methionine (M), Alanine (A), Valine (V), and 

6) Phenylalanine (F), Tyrosine (Y), Tryptophan (W). 

15 Alternatively, a conservative replacement is any change having a positive value in 

the PAM250 log-likelihood matrix disclosed in Gonnet et al, Science 256: 1443-45 
(1992), herein incorporated by reference. A "moderately conservative" replacement is 
any change having a nonnegative value in the PAM250 log-likelihood matrix. 

Sequence similarity for polypeptides, which is also referred to as sequence 

20 identity, is typically measured using sequence analysis software. Protein analysis 
software matches similar sequences using measures of similarity assigned to various 
substitutions, deletions and other modifications, including conservative amino acid 
substitutions. For instance, GCG contains programs such as "Gap" and "Bestfit" which 
can be used with default parameters to determine sequence homology or sequence 

25 identity between closely related polypeptides, such as homologous polypeptides from 
different species of organisms or between a wild type protein and a mutein thereof. See, 
e.g. , GCG Version 6. L Other programs include FASTA, discussed supra. 

A preferred algorithm when comparing a sequence of the invention to a database 
containing a large number of sequences from different organisms is the computer 

30 program BLAST, especially blastp or tblastn. See, e.g., Altschul et al, J. Mol. Biol 215: 
403-410 (1990); Altschul et al, Nucleic Acids Res. 25:3389-402 (1997); herein 
incorporated by reference. Preferred parameters for blastp are: 



WO 02/068633 



-28- 



PCTYUS01/43612 



Expectation value: 1 0 (default) 
Filter: seg (default) 

Cost to open a gap: 1 1 (default) 
Cost to extend a gap: 1 (default 
5 Max, alignments: 100 (default) 

Word size: 1 1 (default) 

No. of descriptions: 100 (default) 
Penalty Matrix: BLOSUM62 

The length of polypeptide sequences compared for homology will generally be at 
10 least about 16 amino acid residues, usually at least about 20 residues, more usually at 
least about 24 residues, typically at least about 28 residues, and preferably more than 
about 35 residues. When searching a database containing sequences from a large number 
of different organisms, it is preferable to compare amino acid sequences. 

Database searching using amino acid sequences can be measured by algorithms 
1 5 other than blastp are known in the art. For instance, polypeptide sequences can be 
compared using FASTA, a program in GCG Version 6.1 . FASTA (e.g., FASTA2 and 
FASTA3) provides alignments and percent sequence identity of the regions of the best 
overlap between the query and search sequences (Pearson (1990), supra; Pearson (2000), 
supra. For example, percent sequence identity between amino acid sequences can be 
20 determined using FASTA with its default or recommended parameters (a word size of 2 
and the PAM250 scoring matrix), as provided in GCG Version 6.1, herein incorporated 
by reference. 

An "antibody" refers to an intact immunoglobulin, or to an antigen-binding 
portion thereof that competes with the intact antibody for specific binding to a molecular 

25 species, e.g., a polypeptide of the instant invention. Antigen-binding portions may be 
produced by recombinant DNA techniques or by enzymatic or chemical cleavage of 
intact antibodies. Antigen-binding portions include, inter alia, Fab, Fab', F(ab')2, Fv, 
dAb, and complementarity determining region (CDR) fragments, single-chain antibodies 
(scFv), chimeric antibodies, diabodies and polypeptides that contain at least a portion of 

30 an immunoglobulin that is sufficient to confer specific antigen binding to the 

polypeptide. An Fab fragment is a monovalent fragment consisting of the VL, VH, CL 
and CHI domains; an F(ab')2 fragment is a bivalent fragment comprising two Fab 
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fragments linked by a disulfide bridge at the hinge region; an Fd fragment consists of the 
VH and CHI domains; an Fv fragment consists of the VL and VH domains of a single 
arm of an antibody; and a dAb fragment consists of a VH domain. See, e.g., Ward et al, 
Nature 341: 544-546 (1989). 

5 By "bind specifically" and "specific binding" is here intended the ability of the 

antibody to bind to a first molecular species in preference to binding to other molecular 
species with which the antibody and first molecular species are admixed. An antibody is 
said specifically to "recognize" a first molecular species when it can bind specifically to 
that first molecular species. 

10 A single-chain antibody (scFv) is an antibody in which a VL and VH region are 

paired to form a monovalent molecule via a synthetic linker that enables them to be made 
as a single protein chain. See, e.g., Bird et al, Science 242: 423-426 (1988); Huston et 
al, Proc. Natl Acad. Set USA 85: 5879-5883 (1988). Diabodies are bivalent, bispecific 
antibodies in which VH and VL domains are expressed on a single polypeptide chain, but 

1 5 using a linker that is too short to allow for pairing between the two domains on the same 
chain, thereby forcing the domains to pair with complementary domains of another chain 
and creating two antigen binding sites. See e.g., Holliger et al, Proc. Natl Acad. Sci. 
USA 90: 6444-6448 (1993); Poljake* al, Structure 2: 1121-1123 (1994). Oneormore 
CDRs may be incorporated into a molecule either covalently or noncovalently to make it 

20 an immunoadhesin. An immunoadhesin may incorporate the CDR(s) as part of a larger 
polypeptide chain, may covalently link the CDR(s) to another polypeptide chain, or may 
incorporate the CDR(s) noncovalently. The CDRs permit the immunoadhesin to 
specifically bind to a particular antigen of interest. A chimeric antibody is an antibody 
that contains one or more regions from one antibody and one or more regions from one 

25 or more other antibodies. 

An antibody may have one or more binding sites. If there is more than one 
binding site, the binding sites may be identical to one another or may be different. For 
instance, a naturally-occurring immunoglobulin has two identical binding sites, a single- 
chain antibody or Fab fragment has one binding site, while a "bispecific" or 

30 <c bifunctional" antibody has two different binding sites. 

An "isolated antibody" is an antibody that (1) is not associated with naturally- 
associated components, including other naturally-associated antibodies, that accompany 
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it in its native state, (2) is free of other proteins from the same species, (3) is expressed 
by a cell from a different species, or (4) does not occur in nature. It is known that 
purified proteins, including purified antibodies, may be stabilized with non-naturally- 
associated components. The non-naturally-associated component may be a protein, such 
5 as albumin (e.g., BSA) or a chemical such as polyethylene glycol (PEG). 

A "neutralizing antibody" or "an inhibitory antibody" is an antibody that inhibits 
the activity of a polypeptide or blocks the binding of a polypeptide to a ligand that 
normally binds to it. An "activating antibody" is an antibody that increases the activity 
of a polypeptide. 

1 0 The term "epitope" includes any protein determinant capable of specifically 

binding to an immunoglobulin or T-cell receptor. Epitopic determinants usually consist 
of chemically active surface groupings of molecules such as amino acids or sugar side 
chains and usually have specific three-dimensional structural characteristics, as well as 
specific charge characteristics. An antibody is said to specifically bind an antigen when 

15 the dissociation constant is less thanl jiM, preferably less thanlOO nM and most 
preferably less than 10 nM. 

The term "patient" as used herein includes human and veterinary subjects. 
Throughout this specification and claims, the word "comprise," or variations such 
as "comprises" or "comprising," will be understood to imply the inclusion of a stated 

20 integer or group of integers but not the exclusion of any other integer or group of 
integers. 

The term "lung specific" refers to a nucleic acid molecule or polypeptide that is 
expressed predominantly in the lung as compared to other tissues in the body. In a 
preferred embodiment, a "lung specific" nucleic acid molecule or polypeptide is 

25 expressed at a level that is 5 -fold higher than any other tissue in the body. In a more 
preferred embodiment, the "lung specific" nucleic acid molecule or polypeptide is 
expressed at a level that is 10-fold higher than any other tissue in the body, more 
preferably at least 15-fold, 20-fold, 25-fold, 50-fold or 100-fold higher than any other 
tissue in the body. Nucleic acid molecule levels may be measured by nucleic acid 

30 hybridization, such as Northern blot hybridization, or quantitative PCR. Polypeptide 
levels may be measured by any method known to accurately quantitate protein levels, 
such as Western blot analysis. 
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Nucleic Acid Molecules, Regulatory Sequences, Vectors. Host Cells and R ecombinant 
Methods of Making Polypeptides 

Nucleic Acid Molecules 

5 One aspect of the invention provides isolated nucleic acid molecules that are 

specific to the lung or to lung cells or tissue or that are derived from such nucleic acid 
molecules. These isolated lung specific nucleic acids (LSNAs) may comprise a cDNA, a 
genomic DNA, RNA, or a fragment of one of these nucleic acids, or may be a non- 
naturally-occurring nucleic acid molecule. In a preferred embodiment, the nucleic acid 

10 molecule encodes a polypeptide that is specific to lung, a lung-specific polypeptide 
(LSP). In a more preferred embodiment, the nucleic acid molecule encodes a 
polypeptide that comprises an amino acid sequence of SEQ ID NO: 165 through 284. ha 
another highly preferred embodiment, the nucleic acid molecule comprises a nucleic acid 
sequence of SEQ ID NO: 1 throughl64. 

15 An LSNA may be derived from a human or from another animal. In a preferred 

embodiment, the LSNA is derived from a human or other mammal. In a more preferred 
embodiment, the LSNA is derived from a human or other primate. In an even more 
preferred embodiment, the LSNA is derived from a human. 

By "nucleic acid molecule" for purposes of the present invention, it is also meant 

20 to be inclusive of nucleic acid sequences that selectively hybridize to a nucleic acid 
molecule encoding an LSNA or a complement thereof. The hybridizing nucleic acid 
molecule may or may not encode a polypeptide or may not encode an LSP. However, in 
a preferred embodiment, the hybridizing nucleic acid molecule encodes an LSP. In a 
more preferred embodiment, the invention provides a nucleic acid molecule that 

25 selectively hybridizes to a nucleic acid molecule that encodes a polypeptide comprising 
an amino acid sequence of SEQ ID NO: 165 through 284. In an even more preferred 
embodiment, the invention provides a nucleic acid molecule that selectively hybridizes to 
a nucleic acid molecule comprising the nucleic acid sequence of SEQ ID NO: 1 through 
164. 

30 In a preferred embodiment, the nucleic acid molecule selectively hybridizes to a 

nucleic acid molecule encoding an LSP under low stringency conditions. In a more 
preferred embodiment, the nucleic acid molecule selectively hybridizes to a nucleic acid 
molecule encoding an LSP under moderate stringency conditions. In a more preferred 
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embodiment, the nucleic acid molecule selectively hybridizes to a nucleic acid molecule 
encoding an LSP under high stringency conditions. In an even more preferred 
embodiment, the nucleic acid molecule hybridizes under low, moderate or high 
stringency conditions to a nucleic acid molecule encoding a polypeptide comprising an 
5 amino acid sequence of SEQ ID NO: 165 through 284. In a yet more preferred 
embodiment, the nucleic acid molecule hybridizes under low, moderate or high 
stringency conditions to a nucleic acid molecule comprising a nucleic acid sequence 
selected from SEQ ID NO: 1 through 164. In a preferred embodiment of the invention, 
the hybridizing nucleic acid molecule may be used to express recombinantly a 

10 polypeptide of the invention. 

By "nucleic acid molecule" as used herein it is also meant to be inclusive of 
sequences that exhibits substantial sequence similarity to a nucleic acid encoding an LSP 
or a complement of the encoding nucleic acid molecule. In a preferred embodiment, the 
nucleic acid molecule exhibits substantial sequence similarity to a nucleic acid molecule 

1 5 encoding human LSP. In a more preferred embodiment, the nucleic acid molecule 
exhibits substantial sequence similarity to a nucleic acid molecule encoding a 
polypeptide having an amino acid sequence of SEQ ID NO: 165 through 284. In a 
preferred embodiment, the similar nucleic acid molecule is one that has at least 60% 
sequence identity with a nucleic acid molecule encoding an LSP, such as a polypeptide 

20 having an amino acid sequence of SEQ ID NO: 165 through 284, more preferably at least 
70%, even more preferably at least 80% and even more preferably at least 85%. In a 
more preferred embodiment, the similar nucleic acid molecule is one that has at least 
90% sequence identity with a nucleic acid molecule encoding an LSP, more preferably at 
least 95%, more preferably at least 97%, even more preferably at least 98%, and still 

25 more preferably at least 99%. In another highly preferred embodiment, the nucleic acid 
molecule is one that has at least 99.5%, 99.6%, 99.7%, 99.8% or 99.9% sequence identity 
with a nucleic acid molecule encoding an LSP. 

In another preferred embodiment, the nucleic acid molecule exhibits substantial 
sequence similarity to an LSNA or its complement. In a more preferred embodiment, the 

30 nucleic acid molecule exhibits substantial sequence similarity to a nucleic acid molecule 
comprising a nucleic acid sequence of SEQ ID NO: 1 through 164. In a preferred 
embodiment, the nucleic acid molecule is one that has at least 60% sequence identity 
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with an LSNA, such as one having a nucleic acid sequence of SEQ ID NO: 1 through 
164, more preferably at least 70%, even more preferably at least 80% and even more 
preferably at least 85%. In a more preferred embodiment, the nucleic acid molecule is 
one that has at least 90% sequence identity with an LSNA, more preferably at least 95%, 
5 more preferably at least 97%, even more preferably at least 98%, and still more 
preferably at least 99%. In another highly preferred embodiment, the nucleic acid 
molecule is one that has at least 99.5%, 99.6%, 99.7%, 99.8% or 99.9% sequence identity 
with an LSNA. 

A nucleic acid molecule that exhibits substantial sequence similarity may be one 

10 that exhibits sequence identity over its entire length to an LSNA or to a nucleic acid 

molecule encoding an LSP, or may be one that is similar over only a part of its length. In 
this case, the part is at least 50 nucleotides of the LSNA or the nucleic acid molecule 
encoding an LSP, preferably at least 100 nucleotides, more preferably at least 150 or 200 
nucleotides, even more preferably at least 250 or 300 nucleotides, still more preferably at 

1 5 least 400 or 500 nucleotides. 

The substantially similar nucleic acid molecule may be a naturally-occurring one 
that is derived from another species, especially one derived from another primate, 
wherein the similar nucleic acid molecule encodes an amino acid sequence that exhibits 
significant sequence identity to that of SEQ ID NO: 165 through 284 or demonstrates 

20 significant sequence identity to the nucleotide sequence of SEQ ID NO: 1 through 164. 
The similar nucleic acid molecule may also be a naturally-occurring nucleic acid 
molecule from a human, when the LSNA is a member of a gene family. The similar 
nucleic acid molecule may also be a naturally-occurring nucleic acid molecule derived 
from a non-primate, mammalian species, including without limitation, domesticated 

25 species, e.g t> dog, cat, mouse, rat, rabbit, hamster, cow, horse and pig; and wild animals, 
e.g., monkey, fox, lions, tigers, bears, giraffes, zebras, etc. The substantially similar 
nucleic acid molecule may also be a naturally-occurring nucleic acid molecule derived 
from a non-mammalian species, such as birds or reptiles. The naturally-occurring 
substantially similar nucleic acid molecule may be isolated directly from humans or other 

30 species. In another embodiment, the substantially similar nucleic acid molecule may be 
one that is experimentally produced by random mutation of a nucleic acid molecule. In 
another embodiment, the substantially similar nucleic acid molecule may be one that is 
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experimentally produced by directed mutation of an LSNA. Further, the substantially 
similar nucleic acid molecule may or may not be an LSNA. However, in a preferred 
embodiment, the substantially similar nucleic acid molecule is an LSNA. 

By "nucleic acid molecule" it is also meant to be inclusive of allelic variants of an 
5 LSNA or a nucleic acid encoding an LSP. For instance, single nucleotide 

polymorphisms (SNPs) occur frequently in eukaryotic genomes. In fact, more than 1 .4 
million SNPs have already identified in the human genome, International Human 
Genome Sequencing Consortium, Nature 409: 860-921 (2001). Thus, the sequence 
determined from one individual of a species may differ from other allelic forms present 

10 within the population. Additionally, small deletions and insertions, rather than single 
nucleotide polymorphisms, are not uncommon in the general population, and often do not 
alter the function of the protein. Further, amino acid substitutions occur frequently 
among natural allelic variants, and often do not substantially change protein function. 
In a preferred embodiment, the nucleic acid molecule comprising an allelic 

15 variant is a variant of a gene, wherein the gene is transcribed into an mRNA that encodes 
an LSP. In a more preferred embodiment, the gene is transcribed into an mRNA that 
encodes an LSP comprising an amino acid sequence of SEQ ID NO: 165 through 284. In 
another preferred embodiment, the allelic variant is a variant of a gene, wherein the gene 
is transcribed into an mRNA that is an LSNA. In a more preferred embodiment, the gene 

20 is transcribed into an mRNA that comprises the nucleic acid sequence of SEQ ID NO: 1 
through 164. In a preferred embodiment, the allelic variant is a naturally-occurring 
allelic variant in the species of interest. In a more preferred embodiment, the species of 
interest is human. 

By "nucleic acid molecule" it is also meant to be inclusive of a part of a nucleic 
25 acid sequence of the instant invention. The part may or may not encode a polypeptide, 
and may or may not encode a polypeptide that is an LSP. However, in a preferred 
embodiment, the part encodes an LSP. In one aspect, the invention comprises a part of 
an LSNA. In a second aspect, the invention comprises a part of a nucleic acid molecule 
that hybridizes or exhibits substantial sequence similarity to an LSNA. In a third aspect, 
30 the invention comprises a part of a nucleic acid molecule that is an allelic variant of an 
LSNA. In a fourth aspect, the invention comprises a part of a nucleic acid molecule that 
encodes an LSP. A part comprises at least 10 nucleotides, more preferably at least 15, 
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17, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 250, 300, 350, 400 or 500 
nucleotides. The maximum size of a nucleic acid part is one nucleotide shorter than the 
sequence of the nucleic acid molecule encoding the full-length protein. 

By "nucleic acid molecule" it is also meant to be inclusive of sequence that 
5 encoding a fusion protein, a homologous protein, a polypeptide fragment, a mutein or a 
polypeptide analog, as described below. 

Nucleotide sequences of the instantly-described nucleic acids were determined by 
sequencing a DNA molecule that had resulted, directly or indirectly, from at least one 
enzymatic polymerization reaction (e.g., reverse transcription and/or polymerase chain 

10 reaction) using an automated sequencer (such as the MegaBACE™ 1000, Molecular 
Dynamics, Sunnyvale, CA, USA). Further, all amino acid sequences of the polypeptides 
of the present invention were predicted by translation from the nucleic acid sequences so 
determined, unless otherwise specified. 

In a preferred embodiment of the invention, the nucleic acid molecule contains 

15 modifications of the native nucleic acid molecule. These modifications include 
normative internucleoside bonds, post-synthetic modifications or altered nucleotide 
analogues. One having ordinary skill in the art would recognize that the type of 
modification that can be made will depend upon the intended use of the nucleic acid 
molecule. For instance, when the nucleic acid molecule is used as a hybridization probe, 

20 the range of such modifications will be limited to those that permit sequence- 
discriminating base pairing of the resulting nucleic acid. When used to direct expression 
of RNA or protein in vitro or in vivo, the range of such modifications will be limited to 
those that permit the nucleic acid to function properly as a polymerization substrate. 
When the isolated nucleic acid is used as a therapeutic agent, the modifications will be 

25 limited to those that do not confer toxicity upon the isolated nucleic acid. 

In a preferred embodiment, isolated nucleic acid molecules can include nucleotide 
analogues that incorporate labels that are directly detectable, such as radiolabels or 
fluorophores, or nucleotide analogues that incorporate labels that can be visualized in a 
subsequent reaction, such as biotin or various haptens. In a more preferred embodiment, 

30 the labeled nucleic acid molecule may be used as a hybridization probe. 
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Common radiolabeled analogues include those labeled with 33 P, 32 P, and 35 S, such 
as - 32 P-dATP, - 32 P-dCTP, - 32 P-dGTP, - 32 P-dTTP, - 32 P-3'dATP, - 32 P-ATP, - 
32 P-CTP, - 32 P-GTP, - 32 P-UTP, - 35 S-dATP, a~ 35 S-GTP, a- 33 P-dATP, and the like. 
Commercially available fluorescent nucleotide analogues readily incorporated 

5 into the nucleic acids of the present invention include Cy3-dCTP, Cy3-dUTP, Cy5- 
dCTP, Cy3-dUTP (Amersham Pharmacia Biotech, Piscataway, New Jersey, USA), 
fluorescein-12-dUTP, tetramethylrhodamine-6-dUTP, Texas Red®-5-dUTP, Cascade 
Blue®-7-dUTP, BODIPY® FL-14-dUTP, BODIPY® TMR-14-dUTP, BODIPY® 
TR-14-dUTP, Rhodamine Green™-5-dUTP, Oregon Green® 488-5-dUTP, Texas 

10 Red®-12-dUTP, BODIPY® 630/650-14-dUTP, BODIPY® 650/665- 14-dUTP, Alexa 
Fluor® 488-5-dUTP, Alexa Fluor® 532-5-dUTP, Alexa Fluoi® 568-5-dUTP, Alexa 
Fluoi® 594-5-dUTP, Alexa Fluor® 546-14-dUTP, fluorescein- 12-UTP, 
tetramethylrhodamine-6-UTP, Texas Red®-5-UTP, Cascade Blue®-7-UTP, BODIPY® 
FL-14-UTP, BODIPY® TMR-14-UTP, BODffY® TR-14-UTP, Rhodamine 

15 Green™-5-UTP, Alexa Fluor® 488-5-UTP, Alexa Fluor® 546-14-UTP (Molecular 
Probes, Inc. Eugene, OR, USA). One may also custom synthesize nucleotides having 
other fluorophores. See Henegariu et aL, Nature Biotechnol 18: 345-348 (2000), the 
disclosure of which is incorporated herein by reference in its entirety. 

Haptens that are commonly conjugated to nucleotides for subsequent labeling 

20 include biotin (biotin-1 1 -dUTP, Molecular Probes, Inc., Eugene, OR, USA; 

biotin-21-UTP, biotin-21-dUTP, Clontech Laboratories, Inc., Palo Alto, CA, USA), 
digoxigenin (DIG-ll-dUTP, alkali labile, DIG-11-UTP, Roche Diagnostics Corp., 
Indianapolis, IN, USA)* and dinitrophenyl (dinitrophenyl-1 1-dUTP, Molecular Probes, 
Inc., Eugene, OR, USA). 

25 Nucleic acid molecules can be labeled by incorporation of labeled nucleotide 

analogues into the nucleic acid. Such analogues can be incorporated by enzymatic 
polymerization, such as by nick translation, random priming, polymerase chain reaction 
(PCR), terminal transferase tailing, and end-filling of overhangs, for DNA molecules, 
and in vitro transcription driven, e.g., from phage promoters, such as T7, T3, and SP6, for 

30 RNA molecules. Commercial kits are readily available for each such labeling approach. 
Analogues can also be incorporated during automated solid phase chemical synthesis. 
Labels can also be incorporated after nucleic acid synthesis, with the 5' phosphate and 3' 
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hydroxyl providing convenient sites for post-synthetic covalent attachment of detectable 
labels. 

Other post-synthetic approaches also permit internal labeling of nucleic acids. 
For example, fluorophores can be attached using a cisplatin reagent that reacts with the 
5 N7 of guanine residues (and, to a lesser extent, adenine bases) in DNA, RNA, and PNA 
to provide a stable coordination complex between the nucleic acid and fluorophore label 
(Universal Linkage System) (available from Molecular Probes, Inc., Eugene, OR, USA 
and Amersham Pharmacia Biotech, Piscataway, NJ, USA); see Alers et al, Genes, 
Chromosomes & Cancer 25: 301- 305 (1999); Jelsma et al, J. NIHRes. 5: 82 (1994); 

10 Van Belkum et al, BioTechniques 16: 148-153 (1994), incorporated herein by reference. 
As another example, nucleic acids can be labeled using a disulfide-containing linker 
(FastTag™ Reagent, Vector Laboratories, Inc., Burlingame, CA, USA) that is photo- or 
thermally-coupled to the target nucleic acid using aryl azide chemistry; after reduction, a 
free thiol is available for coupling to a hapten, fluorophore, sugar, affinity ligand, or 

15 other marker. 

One or more independent or interacting labels can be incorporated into the 
nucleic acid molecules of the present invention. For example, both a fluorophore and a 
moiety that in proximity thereto acts to quench fluorescence can be included to report 
specific hybridization through release of fluorescence quenching or to report 

20 exonucleotidic excision. See, e.g., Tyagi et al, Nature Biotechnol. 14: 303-308 (1996); 
Tyagi et al, Nature Biotechnol 16: 49-53 (1998); Sokol et al, Proa Natl Acad ScL 
USA 95: 1 1538-1 1543 (1998); Kostrikis et al, Science 279: 1228-1229 (1998); Marras 
et al, Genet Anal 14: 151-156 (1999); U. S. Patent 5,846,726; 5,925,517; 5,925,517; 
5,723,591 and 5,538,848; Holland etal,Proc. Natl Acad. Set USA 88: 7276-7280 

25 (1991); Heid et al, Genome Res. 6(10): 986-94 (1996); Kuimelis et al, Nucleic Acids 
Symp. Ser. (37): 255-6 (1997); the disclosures of which are incorporated herein by 
reference in their entireties. 

Nucleic acid molecules of the invention may be modified by altering one or more 
native phosphodiester internucleoside bonds to more nuclease-resistant, internucleoside 

30 bonds. See Hartmann et al (eds.), Manual of Antisense Methodology: Perspectives in 
. Antisense Science, Kluwer Law International (1999); Stein et al (eds.), Applied 
Antisense Oligonucleotide Technology. Wiley-Liss (1998); Chadwick et al (eds.), 



WO 02/068633 PCT7US01/43612 

-38- 

Oligonucleotides as Therapeutic Agents - Symposium No. 209, John Wiley & Son Ltd 
(1997); the disclosures of which are incorporated herein by reference in their entireties. 
Such altered intemucleoside bonds are often desired for antisense techniques or for 
targeted gene correction. See Gamper et al, Nucl Acids Res. 28(21): 4332-4339 (2000), 
5 the disclosure of which is incorporated herein by reference in its entirety. 

Modified oligonucleotide backbones include, without limitation, 
phosphorothioates, chiral phosphorothioates, phosphorodithioates, phosphotriesters, 
aminoalkylphosphotriesters, methyl and other alkyl phosphonates including 3'-alkylene 
phosphonates and chiral phosphonates, phosphinates, phosphoramidates including 

10 3 '-amino phosphoramidate and aminoalkylphosphoramidates, thionophosphoramidates, 
thionoalkylphosphonates, thionoalkylphosphotriesters, and boranophosphates having 
normal 3'-5' linkages, 2'-5' linked analogs of these, and those having inverted polarity 
wherein the adjacent pairs of nucleoside units are linked 3'-5' to 5'-3' or 2'-5* to 5'-2\ 
Representative United States patents that teach the preparation of the above 

1 5 phosphorus-containing linkages include, but are not limited to, U. S. Patents 3,687,808; 
4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 5,276,019; 
5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 5,455,233; 
5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 
5,571,799; 5,587,361; and 5,625,050, the disclosures of which are incorporated herein by 

20 reference in their entireties. In a preferred embodiment, the modified intemucleoside 
linkages may be used for antisense techniques. 

Other modified oligonucleotide backbones do not include a phosphorus atom, but 
have backbones that are formed by short chain alkyl or cycloalkyl intemucleoside 
linkages, mixed heteroatom and alkyl or cycloalkyl intemucleoside linkages, or one or 

25 more short chain heteroatomic or heterocyclic intemucleoside linkages. These include 
those having morpholino linkages (formed in part from the sugar portion of a 
nucleoside); siloxane backbones; sulfide, sulfoxide and sulfone backbones; formacetyl 
and thioformacetyl backbones; methylene formacetyl and thioformacetyl backbones; 
alkene containing backbones; sulfamate backbones; methyleneimino and 

30 methylenehydrazino backbones; sulfonate and sulfonamide backbones; amide backbones; 
and others having mixed N, O, S and CH 2 component parts. Representative U.S. patents 
that teach the preparation of the above backbones include, but are not limited to, U.S. 
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Patent 5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 5,264,562; 
5,264,564; 5,405,938; 5,434,257; 5,466,677; 5,470,967; 5,489,677; 5,541,307; 
5,561,225; 5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 
5,618,704; 5,623,070; 5,663,312; 5,633,360; 5,677,437 and 5,677,439; the disclosures of 
5 which are incorporated herein by reference in their entireties. 

In other preferred oligonucleotide mimetics, both the sugar and the 
intemucleoside linkage are replaced with novel groups, such as peptide nucleic acids 
(PNA). In PNA compounds, the phosphodiester backbone of the nucleic acid is replaced 
with an amide-containing backbone, in particular by repeating N-(2-aminoethyl) glycine 

10 units linked by amide bonds. Nucleobases are bound directly or indirectly to aza 

nitrogen atoms of the amide portion of the backbone, typically by methylene carbonyl 
linkages. PNA can be synthesized using a modified peptide synthesis protocol. PNA 
oligomers can be synthesized by both Fmoc and tfioc methods. Representative U.S. 
patents that teach the preparation of PNA compounds include, but are not limited to, U.S 

15 Patent 5,539,082; 5,714,331 ; and 5,719,262, each of which is herein incorporated by 
reference. Automated PNA synthesis is readily achievable on commercial synthesizers 
{see, e.g., "PNA User's Guide," Rev. 2, February 1998, Perseptive Biosystems Part No. 
60138, Applied Biosystems, Inc., Foster City, CA). 

PNA molecules are advantageous for a number of reasons. First, because the 

20 PNA backbone is uncharged, PNA/DNA and PNA/RNA duplexes have a higher thermal 
stability than is found in DNA/DNA and DNA/RNA duplexes. The Tm of a PNA/DNA 
or PNA/RNA duplex is generally 1°C higher per base pair than the Tm of the 
corresponding DNA/DNA or DNA/RNA duplex (in 100 mM NaCl). Second, PNA 
molecules can also form stable PNA/DNA complexes at low ionic strength, under 

25 conditions in which DNA/DNA duplex formation does not occur. Third, PNA also 

demonstrates greater specificity in binding to complementary DNA because a PNA/DNA 
mismatch is more destabilizing than DNA/DNA mismatch. A single mismatch in mixed 
a PNA/DNA 15-mer lowers the Tm by 8-20°C (15°C on average). In the corresponding 
DNA/DNA duplexes, a single mismatch lowers the Tm by 4-1 6°C (1 1°C on average). 

30 Because PNA probes can be significantly shorter than DNA probes, their specificity is 
greater. Fourth, PNA oligomers are resistant to degradation by enzymes, and the lifetime 
of these compounds is extended both in vivo and in vitro because nucleases and proteases 



WO 02/068633 PCT/US01/43612 

-40- 

do not recognize the PNA polyamide backbone with nucleobase sidechains. See, e.g., 
Ray et al y FASEB J. 14(9): 1041-60 (2000); Nielsen et al, Pharmacol Toxicol 86(1): 
3-7 (2000); Larsen et al, Biochim Biophys Acta. 1489(1): 159-66 (1999); Nielsen, Curr. 
Opin. Struct Biol 9(3): 353-7 (1999), and Nielsen, Curr. Opin. Biotechnol 10(1): 71-5 
5 (1999), the disclosures of which are incorporated herein by reference in their entireties. 
Nucleic acid molecules may be modified compared to their native structure 
throughout the length of the nucleic acid molecule or can be localized to discrete portions 
thereof. As an example of the latter, chimeric nucleic acids can be synthesized that have 
discrete DNA and RNA domains and that can be used for targeted gene repair and 
10 modified PCR reactions, as further described in U.S. Patents 5,760,012 and 5,73 1,1 81 , 
Misra et al, Biochem. 37: 1917-1925 (1998); and Finn et al, Nucl Acids Res. 24: 
3357-3363 (1996), the disclosures of which are incorporated herein by reference in their 
entireties. 

Unless otherwise specified, nucleic acids of the present invention can include any 
15 topological conformation appropriate to the desired use; the term thus explicitly 

comprehends, among others, single-stranded, double-stranded, triplexed, quadruplexed, 
partially double-stranded, partially-triplexed, partially-quadruplexed, branched, 
hairpinned, circular, and padlocked conformations. Padlock conformations and their 
utilities are further described in BaneretaL, Curr. Opin. Biotechnol 12: 11-15 (2001); 
20 Escude et al, Proc. Natl Acad. Set USA 14: 96(19): 10603-7 (1999); Nilsson et al, 
Science 265(5181): 2085-8 (1994), the disclosures of which are incorporated herein by 
reference in their entireties. Triplex and quadruplex conformations, and their utilities, are 
reviewed in Praseuth et al, Biochim. Biophys. Acta. 1489(1): 181-206 (1999); Fox, Curr. 
Med. Chem. 7(1): 17-37 (2000);Kocho±ova et al 9 Methods Mol Biol 130: 189-201 
25 (2000); Chan et al, J. Mol Med. 75(4): 267-82 (1997), the disclosures of which are 
incorporated herein by reference in their entireties. 

Methods for Using Nucleic Acid Molecules as Probes and Primers 

The isolated nucleic acid molecules of the present invention can be used as 
30 hybridization probes to detect, characterize, and quantify hybridizing nucleic acids in, 
and isolate hybridizing nucleic acids from, both genomic and transcript-derived nucleic 
acid samples. When free in solution, such probes are typically, but not invariably, 
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detectably labeled; bound to a substrate, as in a microarray, such probes are typically, but 
not invariably unlabeled. 

In one embodiment, the isolated nucleic acids of the present invention can be 
used as probes to detect and characterize gross alterations in the gene of an LSNA, such 
5 as deletions, insertions, translocations, and duplications of the LSNA genomic locus 
through fluorescence in situ hybridization (FISH) to chromosome spreads. See, e.g., 
Andreeff ah (eds.), Introduction to Fluorescence In Situ Hybridization: Principles and 
Clinical Applications. John Wiley & Sons (1999), the disclosure of which is incorporated 
herein by reference in its entirety. The isolated nucleic acids of the present invention can 

1 0 be used as probes to assess smaller genomic alterations using, e.g., Southern blot 
detection of restriction fragment length polymorphisms. The isolated nucleic acid 
molecules of the present invention can be used as probes to isolate genomic clones that 
include the nucleic acid molecules of the present invention, which thereafter can be 
restriction mapped and sequenced to identify deletions, insertions, translocations, and 

15 substitutions (single nucleotide polymorphisms, SNPs) at the sequence level. 

In another embodiment, the isolated nucleic acid molecules of the present 
invention can be used as probes to detect, characterize, and quantify LSNA in, and isolate 
LSNA from, transcript-derived nucleic acid samples. In one aspect, the isolated nucleic 
acid molecules of the present invention can be used as hybridization probes to detect, 

20 characterize by length, and quantify mRNA by Northern blot of total or poly-A + - selected 
RNA samples. In another aspect, the isolated nucleic acid molecules of the present 
invention can be used as hybridization probes to detect, characterize by location, and 
quantify mRNA by in situ hybridization to tissue sections. See, e.g, Schwarchzacher et 
at, In Situ Hybridization, Springer- Verlag New York (2000), the disclosure of which is 

25 incorporated herein by reference in its entirety. In another preferred embodiment, the 
isolated nucleic acid molecules of the present invention can be used as hybridization 
probes to measure the representation of clones in a cDNA library or to isolate hybridizing 
nucleic acid molecules acids from cDNA libraries, permitting sequence level 
characterization of mRNAs that hybridize to LSNAs, including, without limitations, 

30 identification of deletions, insertions, substitutions, truncations, alternatively spliced 
forms and single nucleotide polymorphisms. In yet another preferred embodiment, the 
nucleic acid molecules of the instant invention may be used in microarrays. 
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All of the aforementioned probe techniques are well within the skill in the art, and 
are described at greater length in standard texts such as Sambrook (2001), supra; 
Ausubel (1999), supra; and Walker et al (eds.), The Nucleic Acids Protocols Handbook, 
Humana Press (2000), the disclosures of which are incorporated herein by reference in 
5 their entirety. 

Thus, in one embodiment, a nucleic acid molecule of the invention may be used 
as a probe or primer to identify or amplify a second nucleic acid molecule that selectively 
hybridizes to the nucleic acid molecule of the invention. In a preferred embodiment, the 
probe or primer is derived from a nucleic acid molecule encoding an LSP. In a more 

1 0 preferred embodiment, the probe or primer is derived from a nucleic acid molecule 

encoding a polypeptide having an amino acid sequence of SEQ ID NO: 165 through 284. 
In another preferred embodiment, the probe or primer is derived from an LSNA. In a 
more preferred embodiment, the probe or primer is derived from a nucleic acid molecule 
having a nucleotide sequence of SEQ ID NO: 1 througji 164. 

15 In general, a probe or primer is at least 10 nucleotides in length, more preferably 

at least 12, more preferably at least 14 and even more preferably at least 16 or 17 
nucleotides in length. In an even more preferred embodiment, the probe or primer is at 
least 18 nucleotides in length, even more preferably at least 20 nucleotides and even 
more preferably at least 22 nucleotides in length. Primers and probes may also be longer 

20 in length. For instance, a probe or primer may be 25 nucleotides in length, or may be 30, 
40 or 50 nucleotides in length. Methods of performing nucleic acid hybridization using 
oligonucleotide probes are well-known in the art. See, e.g., Sambrook et al, 1989, supra, 
Chapter 11 and pp. 11.31-11.32 and 11.40-11.44, which describes radiolabeling of short 
probes, and pp. 1 1.45-1 1.53, which describe hybridization conditions for oligonucleotide 

25 probes, including specific conditions for probe hybridization (pp. 1 1.50-1 1.51). 

Methods of performing primer-directed amplification are also well-known in the 
art. Methods for performing the polymerase chain reaction (PCR) axe compiled, inter 
alia, in McPherson, PCR Basics: From Background to Bench. Springer Verlag (2000); 
Innis et al (eds.), PCR Applications: Protocols for Functiona l Genomics , Academic 

30 Press (1999); Gelfand et al (eds.), PCR Strategies, Academic Press (1998); Newton et 
al, PCR, Springer- Verlag New York (1997); Burke (ed.), PCR: Essential Techniques, 
John Wiley & Son Ltd (1996); White (ed.), PCR Cloning Protocols: From Molecular 
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Cloning to Genetic Engineering , Vol. 67, Humana Press (1996); McPherson et al (eds.), 
PCR 2: A Practical Approach, Oxford University Press, Inc. (1995); the disclosures of 
which are incorporated herein by reference in their entireties. Methods for performing 
RT-PCR are collected, e.g., in Siebert et al (eds.), Gene Cloning and Analysis bv 
5 RT-PCR. Eaton Publishing Company/Bio Techniques Books Division, 1998; Siebert 
(ed.), PCR Technique:RT-PCR. Eaton Publishing Company/ BioTechniques Books 
(1995); the disclosure of which is incorporated herein by reference in its entirety. 

PCR and hybridization methods may be used to identify and/or isolate allelic 
variants, homologous nucleic acid molecules and fragments of the nucleic acid molecules 

10 of the invention. PCR and hybridization methods may also be used to identify, amplify 
and/or isolate nucleic acid molecules that encode homologous proteins, analogs, fusion 
protein or muteins of the invention. The nucleic acid primers of the present invention 
can be used to prime amplification of nucleic acid molecules of the invention, using 
transcript-derived or genomic DNA as template. 

15 The nucleic acid primers of the present invention can also be used, for example, 

to prime single base extension (SBE) for SNP detection {See, e.g., U.S. Patent 6,004,744, 
the disclosure of which is incorporated herein by reference in its entirety). 

Isothermal amplification approaches, such as rolling circle amplification, are also 
now well-described. See, e.g., Schweitzer et al, Curr. Opin. Biotechnol. 12(1): 21-7 

20 (2001); U.S. Patents 5,854,033 and 5,714,320; and international patent publications 
WO 97/19193 and WO 00/15779, the disclosures of which are incorporated herein by 
reference in their entireties. Rolling circle amplification can be combined with other 
techniques to facilitate SNP detection. See, e.g, Lizardi et al, Nature Genet 19(3): 
225-32 (1998). 

25 Nucleic acid molecules of the present invention may be bound to a substrate 

either covalently or noncovalently. The substrate can be porous or solid, planar or non- 
planar, unitary or distributed. The bound nucleic acid molecules may be used as 
hybridization probes, and may be labeled or unlabeled. In a preferred embodiment, the 
bound nucleic acid molecules are unlabeled. 

30 In one embodiment, the nucleic acid molecule of the present invention is bound to 

a porous substrate, e.g, a membrane, typically comprising nitrocellulose, nylon, or 
positively-charged derivatized nylon. The nucleic acid molecule of the present invention 
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can be used to detect a hybridizing nucleic acid molecule that is present within a labeled 
nucleic acid sample, e.g., a sample of transcript-derived nucleic acids. In another 
embodiment, the nucleic acid molecule is bound to a solid substrate, including, without 
limitation, glass, amorphous silicon, crystalline silicon or plastics. Examples of plastics 
5 include, without limitation, polymethylacrylic, polyethylene, polypropylene, 
polyacrylate, polymethylmethacrylate, polyvinylchloride, polytetrafluoroethylene, 
polystyrene, polycarbonate, polyacetal, polysulfone, celluloseacetate, cellulosenitrate, 
nitrocellulose, or mixtures thereof. The solid substrate may be any shape, including 
rectangular, disk-like and spherical. In a preferred embodiment, the solid substrate is a 

10 microscope slide or slide-shaped substrate. 

The nucleic acid molecule of the present invention can be attached covalently to a 
surface of the support substrate or applied to a derivatized surface in a chaotropic agent 
that facilitates denaturation and adherence by presumed noncovalent interactions, or 
some combination thereof. The nucleic acid molecule of the present invention can be 

1 5 bound to a substrate to which a plurality of other nucleic acids are concurrently bound, 
hybridization to each of the plurality of bound nucleic acids being separately detectable. 
At low density, e.g. on a porous membrane, these substrate-bound collections are 
typically denominated macroarrays; at higher density, typically on a solid support, such 
as glass, these substrate bound collections of plural nucleic acids are colloquially termed 

20 microarrays. As used herein, the term microarray includes arrays of all densities. It is, 
therefore, another aspect of the invention to provide microarrays that include the nucleic 
acids of the present invention. 

Expression Vectors, Host Cells and Recombinant Methods of Producing Polypeptides 
Another aspect of the present invention relates to vectors that comprise one or 

25 more of the isolated nucleic acid molecules of the present invention, and host cells in 
which such vectors have been introduced. 

The vectors can be used, inter alia, for propagating the nucleic acids of the 
present invention in host cells (cloning vectors), for shuttling the nucleic acids of the 
present invention between host cells derived from disparate organisms (shuttle vectors), 

30 for inserting the nucleic acids of the present invention into host cell chromosomes 

(insertion vectors), for expressing sense or antisense RNA transcripts of the nucleic acids 
of the present invention in vitro or within a host cell, and for expressing polypeptides 
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encoded by the nucleic acids of the present invention, alone or as fusions to heterologous 
polypeptides (expression vectors). Vectors of the present invention will often be suitable 
for several such uses. 

Vectors are by now well-known in the art, and are described, inter alia, in Jones 
5 et al (eds.), Vectors: Cloning Applications: Essential Techniques (Essential Techniques 
Series), John Wiley & Son Ltd. (1998); Jones et al (eds.), Vectors: Expression Systems: 
Essential Techniques (Essential Techniques Series), John Wiley & Son Ltd. (1998); 
Gacesa et al, Vectors: Essential Data, John Wiley & Sons Ltd. (1995); Cid-Arregui 
(eds.), Viral Vectors: Basic Science and Gene Therapy, Eaton Publishing Co. (2000); 

10 Sambrook (2001), supra; Ausubel (1999), supra; the disclosures of which are 

incorporated herein by reference in their entireties. Furthermore, an enormous variety of 
vectors are available commercially. Use of existing vectors and modifications thereof 
being well within the skill in the art, only basic features need be described here. 

Nucleic acid sequences may be expressed by operatively linking them to an 

15 expression control sequence in an appropriate expression vector and employing that 
expression vector to transform an appropriate unicellular host. Expression control 
sequences are sequences which control the transcription, post-transcriptional events and 
translation of nucleic acid sequences. Such operative linking of a nucleic sequence of 
this invention to an expression control sequence, of course, includes, if not already part 

20 of the nucleic acid sequence, the provision of a translation initiation codon, ATG or 
GTG, in the correct reading frame upstream of the nucleic acid sequence. 

A wide variety of host/expression vector combinations may be employed in 
expressing the nucleic acid sequences of this invention. Useful expression vectors, for 
example, may consist of segments of chromosomal, non-chromosomal and synthetic 

25 nucleic acid sequences. 

In one embodiment, prokaryotic cells may be used with an appropriate vector. 
Prokaryotic host cells are often used for cloning and expression. In a preferred 
embodiment, prokaryotic host cells include E. coli, Pseudomonas> Bacillus and 
Streptomyces. In a preferred embodiment, bacterial host cells are used to express the 

30 nucleic acid molecules of the instant invention. Useful expression vectors for bacterial 
hosts include bacterial plasmids, such as those from E. coli, Bacillus or Streptomyces, 
including pBluescript, pGEX-2T, pUC vectors, col El, pCRl, pBR322, pMB9 and their 



WO 02/068633 PCT/USO 1/436 12 

-46- 

derivatives, wider host range plasmids, such as RP4, phage DNAs, e.g., the numerous 
derivatives of phage lambda, e.g., NM989, XGT10 and AXJTll, and other phages, e.g., 
Ml 3 and filamentous single-stranded phage DNA. Where E. coli is used as host, 
selectable markers are, analogously, chosen for selectivity in gram negative bacteria: e.g., 
5 typical markers confer resistance to antibiotics, such as ampicillin, tetracycline, 

chloramphenicol, kanamycin, streptomycin and zeocin; auxotrophic markers can also be 
used. 

In other embodiments, eukaryotic host cells, such as yeast, insect, mammalian or 
plant cells, may be used. Yeast cells, typically S. cerevisiae, are useful for eukaryotic 

10 genetic studies, due to the ease of targeting genetic changes by homologous 

recombination and the ability to easily complement genetic defects using recombinantly 
expressed proteins. Yeast cells are useful for identifying interacting protein components, 
e.g. through use of a two-hybrid system. In a preferred embodiment, yeast cells are 
useful for protein expression. Vectors of the present invention for use in yeast will 

1 5 typically, but not invariably, contain an origin of replication suitable for use in yeast and 
a selectable marker that is functional in yeast. Yeast vectors include Yeast Integrating 
plasmids (e.g., YIp5) and Yeast Replicating plasmids (the YRp and YEp series 
plasmids), Yeast Centromere plasmids (the YCp series plasmids), Yeast Artificial 
Chromosomes (YACs) which are based on yeast linear plasmids, denoted YLp, pGPD-2, 

20 2\i plasmids and derivatives thereof, and improved shuttle vectors such as those 
described in Gietz et al. 9 Gene, 74: 527-34 (1988) (Ylplac, YEplac and YCplac). 
Selectable markers in yeast vectors include a variety of auxotrophic markers, the most 
common of which are (in Saccharomyces cerevisiae) URA3, HIS3, LEU2, TRP1 and 
LYS2, which complement specific auxotrophic mutations, such as ura3-52, his3-Dl, 

25 Ieu2-Dl, trpl-Dl and lys2-201. 

Insect cells are often chosen for high efficiency protein expression. Where the 
host cells are from Spodoptera frugiperda , e.g., Sf9 and Sf21 cell lines, and expresSF™ 
cells (Protein Sciences Corp., Meriden, CT, USA)), the vector replicative strategy is 
typically based upon the baculovirus life cycle. Typically, baculovirus transfer vectors 

30 are used to replace the wild-type AcMNPV polyhedrin gene with a heterologous gene of 
interest. Sequences that flank the polyhedrin gene in the wild-type genome are 
positioned 5* and 3' of the expression cassette on the transfer vectors. Following co- 
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transfection with AcMNPV DNA, a homologous recombination event occurs between 
these sequences resulting in a recombinant virus carrying the gene of interest and the 
polyhedrin or plO promoter. Selection can be based upon visual screening for lacZ 
fusion activity. 

5 In another embodiment, the host cells may be mammalian cells, which are 

particularly useful for expression of proteins intended as pharmaceutical agents, and for 
screening of potential agonists and antagonists of a protein or a physiological pathway. 
Mammalian vectors intended for autonomous extrachromosomal replication will 
typically include a viral origin, such as the S V40 origin (for replication in cell lines 

10 expressing the large T-antigen, such as COS1 and COS7 cells), the papillomavirus origin, 
or the EBV origin for long term episomal replication (for use, e.g., in 293-EBNA cells, 
which constitutively express the EBV EBNA-1 gene product and adenovirus E1A). 
Vectors intended for integration, and thus replication as part of the mammalian 
chromosome, can, but need not, include an origin of replication functional in mammalian 

15 cells, such as the SV40 origin. Vectors based upon viruses, such as adenovirus, adeno- 
associated virus, vaccinia virus, and various mammalian retroviruses, will typically 
replicate according to the viral replicative strategy. Selectable markers for use in 
mammalian cells include resistance to neomycin (G418), blasticidin, hygromycin and to 
zeocin, and selection based upon the purine salvage pathway using HAT medium. 

20 Expression in mammalian cells can be achieved using a variety of plasmids, 

including pSV2, pBC12BI, and p91023, as well as lytic virus vectors (e.g„ vaccinia 
virus, adeno virus, and baculovirus), episomal virus vectors (e.g., bovine papillomavirus), 
and retroviral vectors (e.g., murine retroviruses). Useful vectors for insect cells include 
baculoviral vectors and pVL 941. 

25 Plant cells can also be used for expression, with the vector replicon typically 

derived from a plant virus (e.g., cauliflower mosaic virus, CaMV ; tobacco mosaic virus, 
TMV) and selectable markers chosen for suitability in plants. 

It is known that codon usage of different host cells may be different. For 
example, a plant cell and a human cell may exhibit a difference in codon preference for 

30 encoding a particular amino acid. As a result, human mRNA may not be efficiently 
translated in a plant, bacteria or insect host cell. Therefore, another embodiment of this 
invention is directed to codon optimization. The codons of the nucleic acid molecules of 
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the invention may be modified to resemble, as much as possible, genes naturally 
contained within the host cell without altering the amino acid sequence encoded by the 
nucleic acid molecule. 

Any of a wide variety of expression control sequences may be used in these 
5 vectors to express the DNA sequences of this invention. Such useful expression control 
sequences include the expression control sequences associated with structural genes of 
the foregoing expression vectors. Expression control sequences that control transcription 
include, e.g., promoters, enhancers and transcription termination sites. Expression 
control sequences in eukaryotic cells that control post-transcriptional events include 

10 splice donor and acceptor sites and sequences that modify the half-life of the transcribed 
RNA, e.g. , sequences that direct poly(A) addition or binding sites for RNA-binding 
proteins. Expression control sequences that control translation include ribosome binding 
sites, sequences which direct targeted expression of the polypeptide to or within 
particular cellular compartments, and sequences in the 5' and 3' untranslated regions that 

15 modify the rate or efficiency of translation. 

Examples of useful expression control sequences for a prokaryote, e.g., E. coli, 
will include a promoter, often a phage promoter, such as phage lambda pL promoter, the 
trc promoter, a hybrid derived from the trp and lac promoters, the bacteriophage T7 
promoter (in E. coli cells engineered to express the 17 polymerase), the TAC or TRC 

20 system, the major operator and promoter regions of phage lambda, the control regions of 
fd coat protein, or the araB AD operon. Prokaryotic expression vectors may further 
include transcription terminators, such as the aspA terminator, and elements that facilitate 
translation, such as a consensus ribosome binding site and translation termination codon, 
Schomer et al, Proa Natl. Acad. Sci. USA 83: 8506-8510 (1986). 

25 Expression control sequences for yeast cells, typically S. cerevisiae, will include a 

yeast promoter, such as the CYC1 promoter, the GAL1 promoter, the GAL10 promoter, 
ADH1 promoter, the promoters of the yeast _-mating system, or the GPD promoter, and 
will typically have elements that facilitate transcription termination, such as the 
transcription termination signals from the CYC1 or ADH1 gene. 

30 Expression vectors useful for expressing proteins in mammalian cells will include 

a promoter active in mammalian cells. These promoters include those derived from 
mammalian viruses, such as the enhancer-promoter sequences from the immediate early 
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gene of the human cytomegalovirus (CMV), the enhancer-promoter sequences from the 
Rous sarcoma virus long terminal repeat (RS V LTR), the enhancer-promoter from S V40 
or the early and late promoters of adenovirus. Other expression control sequences 
include the promoter for 3-phosphoglycerate kinase or other glycolytic enzymes, the 
5 promoters of acid phosphatase. Other expression control sequences include those from 
the gene comprising the LSNA of interest. Often, expression is enhanced by 
incorporation of polyadenylation sites, such as the late SV40 polyadenylation site and the 
polyadenylation signal and transcription termination sequences from the bovine growth 
hormone (BGH) gene, and ribosome binding sites. Furthermore, vectors can include 

10 introns, such as intron II of rabbit P-globin gene and the SV40 splice elements. 

Preferred nucleic acid vectors also include a selectable or amplifiable marker 
gene and means for amplifying the copy number of the gene of interest. Such marker 
genes are well-known in the art. Nucleic acid vectors may also comprise stabilizing 
sequences (e.g., ori- or ARS-like sequences and telomere-like sequences), or may 

15 alternatively be designed to favor directed or non-directed integration into the host cell 
genome. In a preferred embodiment, nucleic acid sequences of this invention are inserted 
in frame into an expression vector that allows high level expression of an RNA which 
encodes a protein comprising the encoded nucleic acid sequence of interest. Nucleic acid 
cloning and sequencing methods are well-known to those of skill in the art and are 

20 described in an assortment of laboratory manuals, including Sambrook (1989), supra, 
Sambrook (2000), supra; and Ausubel (1992), supra, Ausubel (1999), supra. Product 
information from manufacturers of biological, chemical and immunological reagents also 
provide useful information. 

Expression vectors may be either constitutive or inducible. Inducible vectors 

25 include either naturally inducible promoters, such as the trc promoter, which is regulated 
by the lac operon, and the pL promoter, which is regulated by tryptophan, the 
MMTV-LTR promoter, which is inducible by dexamethasone, or can contain synthetic 
promoters and/or additional elements that confer inducible control on adjacent promoters. 
Examples of inducible synthetic promoters are the hybrid Plac/ara-1 promoter and the 

30 PLtetO-1 promoter. The PltetO-1 promoter takes advantage of the high expression levels 
from the PL promoter of phage lambda, but replaces the lambda repressor sites with two 
copies of operator 2 of the TnlO tetracycline resistance operon, causing this promoter to 
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be tightly repressed by the Tet repressor protein and induced in response to tetracycline 
(Tc) and Tc derivatives such as anhydrotetracycline. Vectors may also be inducible 
because they contain hormone response elements, such as the glucocorticoid response 
element (GRE) and the estrogen response element (ERE), which can confer hormone 
5 inducibility where vectors are used for expression in cells having the respective hormone 
receptors. To reduce background levels of expression, elements responsive to ecdysone, 
an insect hormone, can be used instead, with coexpression of the ecdysone receptor. 

In one aspect of the invention, expression vectors can be designed to fuse the 
expressed polypeptide to small protein tags that facilitate purification and/or 

10 visualization. Tags that facilitate purification include a polyhistidine tag that facilitates 
purification of the fusion protein by immobilized metal affinity chromatography, for 
example using NiNTA resin (Qiagen Inc., Valencia, CA, USA) or TALON™ resin 
(cobalt immobilized affinity chromatography medium, Clontech Labs, Palo Alto, CA, 
USA). The fusion protein can include a chitin-binding tag and self-excising intein, 

1 5 permitting chitin-based purification with self-removal of the fused tag (IMPACT™ 
system, New England Biolabs, Inc., Beverley, MA, USA). Alternatively, the fusion 
protein can include a calmodulin-binding peptide tag, permitting purification by 
calmodulin affinity resin (Stratagene, La Jolla, CA, USA), or a specifically excisable 
fragment of the biotin carboxylase carrier protein, permitting purification of in vivo 

20 biotinylated protein using an avidin resin and subsequent tag removal (Promega, 

Madison, WI, USA). As another useful alternative, the proteins of the present invention 
can be expressed as a fusion protein with glutathione-S-transferase, the affinity and 
specificity of binding to glutathione permitting purification using glutathione affinity 
resins, such as Glutathione-Superflow Resin (Clontech Laboratories, Palo Alto, CA, 

25 USA), with subsequent elution with free glutathione. Other tags include, for example, 
the Xpress epitope, detectable by anti-Xpress antibody (Invitrogen, Carlsbad, CA, USA), 
a myc tag, detectable by anti-myc tag antibody, the V5 epitope, detectable by anti- V5 
antibody (Invitrogen, Carlsbad, CA, USA), FLAG® epitope, detectable by anti-FLAG® 
antibody (Stratagene, La Jolla, CA, USA), and the HA epitope. 

30 For secretion of expressed proteins, vectors can include appropriate sequences 

that encode secretion signals, such as leader peptides. For example, the pSecTag2 
vectors (Invitrogen, Carlsbad, CA, USA) are 5.2 kb mammalian expression vectors that 
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carry the secretion signal from the V-J2-C region of the mouse Ig kappa-chain for 
efficient secretion of recombinant proteins from a variety of mammalian cell lines. 

Expression vectors can also be designed to fuse proteins encoded by the 
heterologous nucleic acid insert to polypeptides that are larger than purification and/or 
5 identification tags. Useful fusion proteins include those that permit display of the 
encoded protein on the surface of a phage or cell, fusion to intrinsically fluorescent 
proteins, such as those that have a green fluorescent protein (GFP)-like chromophore, 
fusions to the IgG Fc region, and fusion proteins for use in two hybrid systems. 

Vectors for phage display fuse the encoded polypeptide to, e.g., the gene m 

1 0 protein (pm) or gene VDI protein (pVIQ) for display on the surface of filamentous 
phage, such as M13. See Barbas et al, Phage Display: A Laboratory Manual Cold 
Spring Harbor Laboratory Press (2001); Kay et al. (eds.), Phage Display of Peptides and 
Proteins: A Laboratory Manual Academic Press, Inc., (1996); Abelson et al (eds.), 
Combinatorial Chemistry (Methods in Enzymology, Vol 267) Academic Press (1996). 

15 Vectors for yeast display, e.g. the pYDl yeast display vector (Invitrogen, Carlsbad, CA, 
USA), use the -agglutinin yeast adhesion receptor to display recombinant protein on the 
surface of S. cerevisiae. Vectors for mammalian display, e.g., the pDisplay™ vector 
(Invitrogen, Carlsbad, CA, USA), target recombinant proteins using an N-terminal cell 
surface targeting signal and a C-terminal transmembrane anchoring domain of platelet 

20 derived growth factor receptor. 

A wide variety of vectors now exist that fuse proteins encoded by heterologous 
nucleic acids to the chromophore of the substrate-independent, intrinsically fluorescent 
green fluorescent protein from Aequorea victoria ("GFP") and its variants. The GFP-like 
chromophore can be selected from GFP-like chromophores found in naturally occurring 

25 proteins, such as A. victoria GFP (GenBank accession number AAA27721), Renilla 
reniformis GFP, FP583 (GenBank accession no. AF168419) (DsRed), FP593 
(AF27271 1), FP483 (AF168420), EP484 (AF168424), FP595 (AF246709), FP486 
(AF168421), FP538 (AF168423), and FP506 (AF168422), and need include only so 
much of the native protein as is needed to retain the chromophore's intrinsic 

30 fluorescence. Methods for determining the minimal domain required for fluorescence are 
known in the art. See Li et al, J. Biol Chem. 272: 28545-28549 (1997). Alternatively, 
the GFP-like chromophore can be selected from GFP-like chromophores modified from 
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those found in nature. The methods for engineering such modified GFP-like 
chromophores and testing them for fluorescence activity, both alone and as part of 
protein fusions, are well-known in the art. See Heim et al % Curr. Biol 6: 178-182 (1996) 
and Palm et al, Methods Enzymol 302: 378-394 (1999), incorporated herein by 
5 reference in its entirety. A variety of such modified chromophores are now 

commercially available and can readily be used in the fusion proteins of the present 
invention. These include EGFP ("enhanced GFP"), EBFP ("enhanced blue fluorescent 
protein"), BFP2, EYFP ("enhanced yellow fluorescent protein"), ECFP ("enhanced cyan 
fluorescent protein") or Citrine. EGFP {see, e.g, Cormacke* a/., Gene 173: 33-38 

10 (1996); United States Patent Nos. 6,090,919 and 5,804,387) is found on a variety of 
vectors, both plasmid and viral, which are available commercially (Clontech Labs, Palo 
Alto, CA, USA); EBFP is optimized for expression in mammalian cells whereas BFP2, 
which retains the original jellyfish codons, can be expressed in bacteria (see, e.g. Heim 
et al, Curr. Biol 6: 178-182 (1996) and Cormack et al, Gene 173: 33-38 (1996)). 

15 Vectors containing these blue-shifted variants are available from Clontech Labs (Palo 
Alto, CA, USA). Vectors containing EYFP, ECFP (see, e.g., Heim et al, Curr. Biol 6: 
178-182 (1996); Miyawaki et al, Nature 388: 882-887 (1997)) and Citrine (see, e.g, 
Heikal et al, Proc. Natl Acad. Sci. USA 97: 11996-12001 (2000)) are also available 
from Clontech Labs. The GFP-like chromophore can also be drawn from other modified 

20 GFPs, including those described in U.S. Patents 6,124,128; 6,096,865; 6,090,919; 
6,066,476; 6,054,321; 6,027,881; 5,968,750; 5,874,304; 5,804,387; 5,777,079; 
5,741,668; and 5,625,048, the disclosures of which are incorporated herein by reference 
in their entireties. See also Conn (ed.), Green Fluorescent Protein (Methods in 
Enzymology, Vol. 302), Academic Press, Inc. (1999). The GFP-like chromophore of 

25 each of these GFP variants can usefully be included in the fusion proteins of the present 
invention. 

Fusions to the IgG Fc region increase serum half life of protein pharmaceutical 
products through interaction with the FcRn receptor (also denominated the FcRp receptor 
and the Brambell receptor, FcRb), further described in International Patent Application 
30 Nos. WO 97/43316, WO 97/34631, WO 96/32478, WO 96/18412. 

For long-term, high-yield recombinant production of the proteins, protein fusions, 
and protein fragments of the present invention, stable expression is preferred. Stable 
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expression is readily achieved by integration into the host cell genome of vectors having 
selectable markers, followed by selection of these integrants. Vectors such as 
pUB6/V5-His A, B, and C (Invitrogen, Carlsbad, CA, USA) are designed for high-level 
stable expression of heterologous proteins in a wide range of mammalian tissue types and 
5 cell lines. pUB6/V5-His uses the promoter/enhancer sequence from the human ubiquitin 
C gene to drive expression of recombinant proteins: expression levels in 293, CHO, and 
NIH3T3 cells are comparable to levels from the CMV and human EF-la promoters. The 
bsd gene permits rapid selection of stably transfected mammalian cells with the potent 
antibiotic blasticidin. 

1 0 Replication incompetent retroviral vectors, typically derived from Moloney 

murine leukemia virus, also are useful for creating stable transfectants having integrated 
provirus. The highly efficient transduction machinery of retroviruses, coupled with the 
availability of a variety of packaging cell lines such as RetroPack™ PT 67, EcoPack2™- 
293, AmphoPack-293, and GP2-293 cell lines (all available from Clontech Laboratories, 

1 5 Palo Alto, CA, USA), allow a wide host range to be infected with high efficiency; 
varying the multiplicity of infection readily adjusts the copy number of the integrated 
provirus. 

Of course, not all vectors and expression control sequences will function equally 
well to express the nucleic acid sequences of this invention. Neither will all hosts 

20 function equally well with the same expression system. However, one of skill in the art 
may make a selection among these vectors, expression control sequences and hosts 
without undue experimentation and without departing from the scope of this invention. 
For example, in selecting a vector, the host must be considered because the vector must 
be replicated in it. The vector's copy number, the ability to control that copy number, the 

25 ability to control integration, if any, and the expression of any other proteins encoded by 
the vector, such as antibiotic or other selection markers, should also be considered. The 
present invention further includes host cells comprising the vectors of the present 
invention, either present episomally within the cell or integrated, in whole or in part, into 
the host cell chromosome. Among other considerations, some of which are described 

30 above, a host cell strain may be chosen for its ability to process the expressed protein in 
the desired fashion. Such post-translational modifications of the polypeptide include, but 
are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation, 
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and acylation, and it is an aspect of the present invention to provide LSPs with such post- 
translational modifications. 

Polypeptides of the invention may be post-translationally modified. Post- 
translational modifications include phosphorylation of amino acid residues serine, 
5 threonine and/or tyrosine, N-linked and/or O-linked glycosylation, methylation, 
acetylation, prenylation, methylation, acetylation, arginylation, ubiquination and 
racemization. One may determine whether a polypeptide of the invention is likely to be 
post-translationally modified by analyzing the sequence of the polypeptide to determine 
if there are peptide motifs indicative of sites for post-translational modification. There 

10 are a number of computer programs that permit prediction of post-translational 

modifications. See, e.g., www.expasy.org (accessed August 31, 2001), which includes 
PSORT, for prediction of protein sorting signals and localization sites, SignalP, for 
prediction of signal peptide cleavage sites, MTTOPROT and Predotar, for prediction of 
mitochondrial targeting sequences, NetOGlyc, for prediction of type O-glycosylation 

1 5 sites in mammalian proteins, big-PI Predictor and DGPI, for prediction of prenylation- 
anchor and cleavage sites, and NetPhos, for prediction of Ser, Thr and Tyr 
phosphorylation sites in eukaryotic proteins. Other computer programs, such as those 
included in GCG, also may be used to determine post-translational modification peptide 
motifs. 

20 General examples of types of post-translational modifications may be found in 

web sites such as the Delta Mass database http://www.abrf.org/ABRF/Research 
Committees/deltamass/deltamass.html (accessed October 19, 2001); "GlycoSuiteDB: a 
new curated relational database of glycoprotein glycan structures and their biological 
sources" Cooper et al. Nucleic Acids Res. 29; 332-335 (2001) and 

25 http://www.glycosuite.com/ (accessed October 19, 2001); "O-GLYCBASE version 4.0: a 
revised database of O-glycosylated proteins" Gupta et al. Nucleic Acids Research, 27: 
370-372 (1999) and http://www.cbs.dtu.dk/databases/OGLYCBASE/ (accessed October 
19, 2001); "PhosphoBase, a database of phosphorylation sites: release 2.0.", Kreegipuu 
et al. Nucleic Acids Res 27(l):237-239 (1999) and http://www.cbs.dtu.dk/ 

30 databases/PhosphoBase/ (accessed October 19, 2001); or http://pir.georgetown.edu/ 
pirwww/search/textresid.html (accessed October 19, 2001). 
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Tumorigenesis is often accompanied by alterations in the post-translational 
modifications of proteins. Thus, in another embodiment, the invention provides 
polypeptides from cancerous cells or tissues that have altered post-translational 
modifications compared to the post-translational modifications of polypeptides from 
5 normal cells or tissues. A number of altered post-translational modifications are known. 
One common alteration is a change in phosphorylation state, wherein the polypeptide 
from the cancerous cell or tissue is hyperphosphorylated or hypophosphorylated 
compared to the polypeptide from a normal tissue, or wherein the polypeptide is 
phosphorylated on different residues than the polypeptide from a normal cell. Another 

1 0 common alteration is a change in glycosylation state, wherein the polypeptide from the 
cancerous cell or tissue has more or less glycosylation than the polypeptide from a 
normal tissue, and/or wherein the polypeptide from the cancerous cell or tissue has a 
different type of glycosylation than the polypeptide from a noncancerous cell or tissue. 
Changes in glycosylation maybe critical because carbohydrate-protein and carbohydrate- 

1 5 carbohydrate interactions are important in cancer cell progression, dissemination and 
invasion.. See, e.g., Barchi, Curr. Pharm. Des. 6: 485-501 (2000), Verma, Cancer 
Biochem. Biophys. 14: 151-162 (1994) and Dennis et al., Bioessays 5: 412-421 (1999). 

Another post-translational modification that may be altered in cancer cells is 
prenylation. Prenylation is the covalent attachment of a hydrophobic prenyl group (either 

20 famesyl or geranylgeranyl) to a polypeptide. Prenylation is required for localizing a 
protein to a cell membrane and is often required for polypeptide function. For instance, 
the Ras superfamily of GTPase signaling proteins must be prenylated for function in a 
cell. See, e.g., Prendergast et al., Semin. Cancer Biol 10: 443-452 (2000) and Khwaja et 
al., Lancet 355: 741-744 (2000). 

25 Other post-translation modifications that may be altered in cancer cells include, 

without limitation, polypeptide methylation, acetylation, arginylation or racemization of 
amino acid residues. In these cases, the polypeptide from the cancerous cell may exhibit 
either increased or decreased amounts of the post-translational modification compared to 
the corresponding polypeptides from noncancerous cells. 

30 Other polypeptide alterations in cancer cells include abnormal polypeptide 

cleavage of proteins and aberrant protein-protein interactions. Abnormal polypeptide 
cleavage may be cleavage of a polypeptide in a cancerous cell that does not usually occur 
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in a normal cell, or a lack of cleavage in a cancerous cell, wherein the polypeptide is 
cleaved in a normal cell. Aberrant protein-protein interactions may be either covalent 
cross-linking or non-covalent binding between proteins that do not normally bind to each 
other. Alternatively, in a cancerous cell, a protein may fail to bind to another protein to 
5 which it is bound in a noncancerous cell. Alterations in cleavage or in protein-protein 
interactions maybe due to over- or underproduction of a polypeptide in a cancerous cell 
compared to that in a normal cell, or may be due to alterations in post-translational 
modifications (see above) of one or more proteins in the cancerous cell. See, e.g., 
Henschen-Edman,^/i/z, N.Y. Acad Set 936: 580-593 (2001). 

10 Alterations in polypeptide post-translational modifications, as well as changes in 

polypeptide cleavage and protein-protein interactions, may be determined by any method 
known in the art. For instance, alterations in phosphorylation may be determined by 
using anti-phosphoserine, anti-phosphothreonine or anti-phosphotyrosine antibodies or 
by amino acid analysis. Glycosylation alterations may be determined using antibodies 

15 specific for different sugar residues, by carbohydrate sequencing, or by alterations in the 
size of the glycoprotein, which can be determined by, e.g., SDS polyacrylamide gel 
electrophoresis (PAGE). Other alterations of post-translational modifications, such as 
prenylation, racemization, methylation, acetylation and arginylation, may be determined 
by chemical analysis, protein sequencing, amino acid analysis, or by using antibodies 

20 specific for the particular post-translational modifications. Changes in protein-protein 
interactions and in polypeptide cleavage may be analyzed by any method known in the 
art including, without limitation, non-denaturing PAGE (for non-covalent protein-protein 
interactions), SDS PAGE (for covalent protein-protein interactions and protein cleavage), 
chemical cleavage, protein sequencing or immunoassays. 

25 In another embodiment, the invention provides polypeptides that have been post- 

translationally modified. In one embodiment, polypeptides may be modified 
enzymatically or chemically, by addition or removal of a post-translational modification. 
For example, a polypeptide may be glycosylated or deglycosylated enzymatically. 
Similarly, polypeptides may be phosphorylated using a purified kinase, such as a MAP 

30 kinase (e.g, p38, ERK, or JNK) or a tyrosine kinase (e.g., Src or erbB2). A polypeptide 
may also be modified through synthetic chemistry. Alternatively, one may isolate the 
polypeptide of interest from a cell or tissue that expresses the polypeptide with the 
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desired post-translational modification. In another embodiment, a nucleic acid molecule 
encoding the polypeptide of interest is introduced into a host cell that is capable of post- 
translationally modifying the encoded polypeptide in the desired fashion. If the 
polypeptide does not contain a motif for a desired post-translational modification, one 
5 may alter the post-translational modification by mutating the nucleic acid sequence of a 
nucleic acid molecule encoding the polypeptide so that it contains a site for the desired 
post-translational modification. Amino acid sequences that may be post-translationally 
modified are known in the art. See, e.g., the programs described above on the website 
www.expasy.org. The nucleic acid molecule is then be introduced into a host cell that is 

10 capable of post-translationally modifying the encoded polypeptide. Similarly, one may 
delete sites that are post-translationally modified by either mutating the nucleic acid 
sequence so that the encoded polypeptide does not contain the post-translational 
modification motif, or by introducing the native nucleic acid molecule into a host cell 
that is not capable of post-translationally modifying the encoded polypeptide. 

15 In selecting an expression control sequence, a variety of factors should also be 

considered. These include, for example, the relative strength of the sequence, its 
controllability, and its compatibility with the nucleic acid sequence of this invention, 
particularly with regard to potential secondary structures. Unicellular hosts should be 
selected by consideration of their compatibility with the chosen vector, the toxicity of the 

20 product coded for by the nucleic acid sequences of this invention, their secretion 

characteristics, their ability to fold the polypeptide correctly, their fermentation or culture 
requirements, and the ease of purification from them of the products coded for by the 
nucleic acid sequences of this invention. 

The recombinant nucleic acid molecules and more particularly, the expression 

25 vectors of this invention may be used to express the polypeptides of this invention as 
recombinant polypeptides in a heterologous host cell. The polypeptides of this invention 
may be full-length or less than full-length polypeptide fragments recombinantly 
expressed from the nucleic acid sequences according to this invention. Such 
polypeptides include analogs, derivatives and muteins that may or may not have 

30 biological activity. 

Vectors of the present invention will also often include elements that permit in 
vitro transcription of RNA from the inserted heterologous nucleic acid. Such vectors 
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typically include a phage promoter, such as that from T7, T3, or SP6, flanking the nucleic 
acid insert. Often two different such promoters flank the inserted nucleic acid, permitting 
separate in vitro production of both sense and antisense strands. 

Transformation and other methods of introducing nucleic acids into a host cell 

5 (e.g., conjugation, protoplast transformation or fusion, transfection, electroporation, 
liposome delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral 
infection and protoplast fusion) can be accomplished by a variety of methods which are 
well-known in the art (See, for instance, Ausubel, supra, and Sambrook et al, supra). 
Bacterial, yeast, plant or mammalian cells are transformed or transfected with an 

10 expression vector, such as a plasmid, a cosmid, or the like, wherein the expression vector 
comprises the nucleic acid of interest. Alternatively, the cells may be infected by a viral 
expression vector comprising the nucleic acid of interest. Depending upon the host cell, 
vector, and method of transformation used, transient or stable expression of the 
polypeptide will be constitutive or inducible. One having ordinary skill in the art will be 

15 able to decide whether to express a polypeptide transiently or stably, and whether to 
express the protein constitutively or inducibly. 

A wide variety of unicellular host cells are useful in expressing the DNA 
sequences of this invention. These hosts may include well-known eukaryotic and 
prokaryotic hosts, such as strains of, fungi, yeast, insect cells such as Spodoptera 

20 frugiperda (SF9), animal cells such as CHO, as well as plant cells in tissue culture. 

Representative examples of appropriate host cells include, but are not limited to, bacterial 
cells, such as E. coli, Caulobacter crescentus, Streptomyces species, and Salmonella 
typhimurium; yeast cells, such as Saccharomyces cerevisiae, Schizosaccharomyces 
pombe, Pichia pastoris, Pichia methanolica; insect cell lines, such as those from 

25 Spodoptera frugiperda, e.g., Sf9 and Sf21 cell lines, and expresSF™ cells (Protein 
Sciences Corp., Meriden, CT, USA), Drosophila S2 cells, and Trichoplusia ni High 
Five® Cells (Invitrogen, Carlsbad, CA, USA); and mammalian cells. Typical 
mammalian cells include BHK cells, BSC 1 cells, BSC 40 cells, BMT 10 cells, VERO 
cells, COS1 cells, COS7 cells, Chinese hamster ovary (CHO) cells, 3T3 cells, NIH 3T3 

30 cells, 293 cells, HEPG2 cells, HeLa cells, L cells, MDCK cells, HEK293 cells, WI38 
cells, murine ES cell lines (e.g., from strains 129/SV, C57/BL6, DBA-1, 129/SVJ), K562 
cells, Jurkat cells, and BW5 147 cells. Other mammalian cell lines are well-known and 
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readily available from the American Type Culture Collection (ATCC) (Manassas, VA, 
USA) and the National Institute of General Medical Sciences (NIGMS) Human Genetic 
Cell Repository at the Coriell Cell Repositories (Camden, NJ, USA). Cells or cell lines 
derived from lung are particularly preferred because they may provide a more native 
5 post-translational processing. Particularly preferred are human lung cells. 

Particular details of the transfection, expression and purification of recombinant 
proteins are well documented and are understood by those of skill in the art. Further 
details on the various technical aspects of each of the steps used in recombinant 
production of foreign genes in bacterial cell expression systems can be found in a number 

10 of texts and laboratory manuals in the art. See, e.g., Ausubel (1992), supra, Ausubel 
(1999), supra, Sambrook (1989), supra, and Sambrook (2001), supra, herein 
incorporated by reference. 

Methods for introducing the vectors and nucleic acids of the present invention 
into the host cells are well-known in the art; the choice of technique will depend 

15 primarily upon the specific vector to be introduced and the host cell chosen. 

Nucleic acid molecules and vectors may be introduced into prokaryotes, such as 
E. coli, in a number of ways. For instance, phage lambda vectors will typically be 
packaged using a packaging extract (e.g., Gigapack® packaging extract, Stratagene, La 
Jolla, CA, USA), and the packaged virus used to infect E. coli. 

20 Plasmid vectors will typically be introduced into chemically competent or 

electrocompetent bacterial cells. E. coli cells can be rendered chemically competent by 
treatment, e.g., with CaCl 2 , or a solution of Mg 2+ , Mn 2+ , Ca 2+ , Rb + or K + , dimethyl 
sulfoxide, dithiothreitol, and hexamine cobalt (HI), Hanahan, J. Mol. Biol 166(4):557-80 
(1983), and vectors introduced by heat shock. A wide variety of chemically competent 

25 strains are also available commercially (e.g., Epicurian Coli® XLIO-Gold® 

Ultracompetent Cells (Stratagene, La Jolla, CA, USA); DH5 competent cells (Clontech 
Laboratories, Palo Alto, CA, USA); and TOP10 Chemically Competent E. coli Kit 
(Invitrogen, Carlsbad, CA, USA)). Bacterial cells can be rendered electrocompetent, that 
is, competent to take up exogenous DNA by electroporation, by various pre-pulse 

30 treatments; vectors are introduced by electroporation followed by subsequent outgrowth 
in selected media. An extensive series of protocols is provided online in Electroprotocols 
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(BioRad, Richmond, CA, USA) (http://ww.bioradxom/LifeScience/pdjG^ 
New_Gene_Pulser.pdf). 

Vectors can be introduced into yeast cells by spheroplasting, treatment with 
lithium salts, electroporation, or protoplast fusion. Spheroplasts are prepared by the 
5 action of hydrolytic enzymes such as snail-gut extract, usually denoted Glusulase, or 
Zymolyase, an enzyme from Arthrobacter luteus, to remove portions of the cell wall in 
the presence of osmotic stabilizers, typically 1 M sorbitol. DNA is added to the 
spheroplasts, and the mixture is co-precipitated with a solution of polyethylene glycol 
(PEG) and Ca 2+ . Subsequently, the cells are resuspended in a solution of sorbitol, mixed 

1 0 with molten agar and then layered on the surface of a selective plate containing sorbitol. 

For Uthium-mediated transformation, yeast cells are treated with lithium acetate, 
which apparently permeabilizes the cell wall, DNA is added and the cells are 
co-precipitated with PEG. The cells are exposed to a brief heat shock, washed free of 
PEG and lithium acetate, and subsequently spread on plates containing ordinary selective 

15 medium. Increased frequencies of transformation are obtained by using 

specially-prepared single-stranded carrier DNA and certain organic solvents. Schiestl et 
aL, Cum Genet 16(5-6): 339-46 (1989). 

For electroporation, freshly-grown yeast cultures are typically washed, suspended 
in an osmotic protectant, such as sorbitol, mixed with DNA, and the cell suspension 

20 pulsed in an electroporation device. Subsequently, the cells are spread on the surface of 
plates containing selective media. Becker et aL, Methods Enzymol 194: 182-187 (1991). 
The efficiency of transformation by electroporation can be increased over 100-fold by 
using PEG, single-stranded carrier DNA and cells that are in late log-phase of growth. 
Larger constructs, such as YACs, can be introduced by protoplast fusion. 

25 Mammalian and insect cells can be directly infected by packaged viral vectors, or 

transfected by chemical or electrical means. For chemical transfection, DNA can be 
coprecipitated with CaP(>4 or introduced using liposomal and nonliposomal lipid-based 
agents. Commercial kits are available for CaPC>4 transfection (CalPhos™ Mammalian 
Transfection Kit, Clontech Laboratories, Palo Alto, CA, USA), and lipid-mediated 

30 transfection can be practiced using commercial reagents, such as LIPOFECT AMINE™ 
2000, LIPOFECTAMINE™ Reagent, CELLFECTIN® Reagent, and LIPOFECTIN® 
Reagent (Lavitrogen, Carlsbad, CA, USA), DOTAP Liposomal Transfection Reagent, 
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FuGENE 6, X-tremeGENE Q2, DOSPER, (Roche Molecular Biochemicals, Indianapolis, 
IN USA), Effectene™, PolyFect®, Superfect® (Qiagen, Inc., Valencia, CA, USA). 
Protocols for electroporating mammalian cells can be found online in Electroprotocols 
(Bio-Rad, Richmond, CA, USA) (http://www.bio-rad.com/LifeScience/pdj^ 
5 New_Gene_Pulser.pdf); Norton et al (eds.), Gene Transfer Methods: Introducing DNA 
into Living Cells and Organisms . BioTechniques Books, Eaton Publishing Co. (2000); 
incorporated herein by reference in its entirety. Other transfection techniques include 
transfection by particle bombardment and microinjection. See, e.g., Cheng et al, Proc. 
Natl Acad. Set USA 90(10): 4455-9 (1993); Yang etal, Proc. Natl Acad. ScL USA 

10 87(24): 9568-72 (1990). 

Production of the recombinantly produced proteins of the present invention can 
optionally be followed by purification. 

Purification of recombinantly expressed proteins is now well by those skilled in 
the art. See, e.g., Thorner et al (eds.), Applications of Chimeric Genes and Hybrid 

1 5 Proteins, Part A: Gene Expression and Protein Purification (Methods in Enzymology, 
Vol. 326), Academic Press (2000); Harbin (ed.), Cloning, Gene Expression and Protein 
Purification : Experimental Procedures and Process Rationale, Oxford Univ. Press 
(2001); Marshak et al % Strategies for Protein Purification and Characterization: A 
Laboratory Course Manual . Cold Spring Harbor Laboratory Press (1996); and Roe (ed.), 

20 Protein Purification Applications. Oxford University Press (2001); the disclosures of 
which are incorporated herein by reference in their entireties, and thus need not be 
detailed here. 

Briefly, however, if purification tags have been fused through use of an 
expression vector that appends such tags, purification can be effected, at least in part, by 
25 means appropriate to the tag, such as use of immobilized metal affinity chromatography 
for polyhistidine tags. Other techniques common in the art include ammonium sulfate 
fractionation, immunoprecipitation, fast protein liquid chromatography (FPLC), high 
performance liquid chromatography (HPLC), and preparative gel electrophoresis. 

Polypeptides 

30 Another object of the invention is to provide polypeptides encoded by the nucleic 

acid molecules of the instant invention. In a preferred embodiment, the polypeptide is a 
lung specific polypeptide (LSP). In an even more preferred embodiment, the polypeptide 
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is derived from a polypeptide comprising the amino acid sequence of SEQ ID NO: 165 
through 284. A polypeptide as defined herein may be produced recombinantly, as 
discussed supra, may be isolated from a cell that naturally expresses the protein, or may 
be chemically synthesized following the teachings of the specification and using methods 
5 well-known to those having ordinary skill in the art. 

In another aspect, the polypeptide may comprise a fragment of a polypeptide, 
wherein the fragment is as defined herein. In a preferred embodiment, the polypeptide 
fragment is a fragment of an LSP. In a more preferred embodiment, the fragment is 
derived from a polypeptide comprising the amino acid sequence of SEQ ID NO: 165 

10 through 284. A polypeptide that comprises only a fragment of an entire LSP may or may 
not be a polypeptide that is also an LSP. For instance, a full-length polypeptide may be 
lung-specific, while a fragment thereof may be found in other tissues as well as in lung. 
A polypeptide that is not an LSP, whether it is a fragment, analog, mutein, homologous 
protein or derivative, is nevertheless useful, especially for immunizing animals to prepare 

15 anti-LSP antibodies. However, in a preferred embodiment, the part or fragment is an 
LSP. Methods of determining whether a polypeptide is an LSP are described infra. 

Fragments of at least 6 contiguous amino acids are useful in mapping B cell and T 
cell epitopes of the reference protein. See, e.g., Geysen et al. y Proc. Natl Acad. Sci. USA 
81: 3998-4002 (1984) and U.S. Patents 4,708,871 and 5,595,915, the disclosures of 

20 which are incorporated herein by reference in their entireties. Because the fragment need 
not itself be immunogenic, part of an immunodominant epitope, nor even recognized by 
native antibody, to be useful in such epitope mapping, all fragments of at least 6 amino 
acids of the proteins of the present invention have utility in such a study. 

Fragments of at least 8 contiguous amino acids, often at least 15 contiguous 

25 amino acids, are useful as immunogens for raising antibodies that recognize the proteins 
of the present invention. See, e.g., Lerner, Nature 299: 592-596 (1982); Shinnick et al, 
AnntL Rev. Microbiol. 37: 425-46 (1983); Sutcliffe et al 9 Science 219: 660-6 (1983), the 
disclosures of which are incorporated herein by reference in their entireties. As further 
described in the above-cited references, virtually all 8-mers, conjugated to a carrier, such 

30 as a protein, prove immunogenic, meaning that they are capable of eliciting antibody for 
the conjugated peptide; accordingly, all fragments of at least 8 amino acids of the 
proteins of the present invention have utility as immunogens. 
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Fragments of at least 8, 9, 10 or 12 contiguous amino acids are also useful as 
competitive inhibitors of binding of the entire protein, or a portion thereof, to antibodies 
(as in epitope mapping), and to natural binding partners, such as subunits in a multimeric 
complex or to receptors or ligands of the subject protein; this competitive inhibition 
5 permits identification and separation of molecules that bind specifically to the protein of 
interest, U.S. Patents 5,539,084 and 5,783,674, incorporated herein by reference in their 
entireties. 

The protein, or protein fragment, of the present invention is thus at least 6 amino 
acids in length, typically at least 8, 9, 10 or 12 amino acids in length, and often at least 15 
10 amino acids in length. Often, the protein of the present invention, or fragment thereof, is 
at least 20 amino acids in length, even 25 amino acids, 30 amino acids, 35 amino acids, 
or 50 amino acids or more in length. Of course, larger fragments having at least 75 
amino acids, 100 amino acids, or even 150 amino acids are also useful, and at times 
preferred. 

15 One having ordinary skill in the art can produce fragments of a polypeptide by 

truncating the nucleic acid molecule, e.g., an LSNA, encoding the polypeptide and then 
expressing it recombinantly. Alternatively, one can produce a fragment by chemically 
synthesizing a portion of the full-length polypeptide. One may also produce a fragment 
by enzymatically cleaving either a recombinant polypeptide or an isolated naturally- 

20 occurring polypeptide. Methods of producing polypeptide fragments are well-known in 
the art. See, e.g., Sambrook (1989), supra; Sambrook (2001), supra; Ausubel (1992), 
supra; and Ausubel (1999), supra. In one embodiment, a polypeptide comprising only a 
fragment of polypeptide of the invention, preferably an LSP, may be produced by 
chemical or enzymatic cleavage of a polypeptide. In a preferred embodiment, a 

25 polypeptide fragment is produced by expressing a nucleic acid molecule encoding a 
fragment of the polypeptide, preferably an LSP, in a host cell. 

By ''polypeptides" as used herein it is also meant to be inclusive of mutants, 
fusion proteins, homologous proteins and allelic variants of the polypeptides specifically 
exemplified. 

30 A mutant protein, or mutein, may have the same or different properties compared 

to a naturally-occurring polypeptide and comprises at least one amino acid insertion, 
duplication, deletion, rearrangement or substitution compared to the amino acid sequence 
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of a native protein. Small deletions and insertions can often be found that do not alter the 
function of the protein. In one embodiment, the mutein may or may not be lung-specific. 
In a preferred embodiment, the mutein is lung-specific. In a preferred embodiment, the 
mutein is a polypeptide that comprises at least one amino acid insertion, duplication, 
5 deletion, rearrangement or substitution compared to the amino acid sequence of SEQ ID 
NO: 165 through 284. In a more preferred embodiment, the mutein is one that exhibits at 
least 50% sequence identity, more preferably at least 60% sequence identity, even more 
preferably at least 70%, yet more preferably at least 80% sequence identity to an LSP 
comprising an amino acid sequence of SEQ ID NO: 165 through 284. In yet a more 
10 preferred embodiment, the mutein exhibits at least 85%, more preferably 90%, even more 
preferably 95% or 96%, and yet more preferably at least 97%, 98%, 99% or 99.5% 
sequence identity to an LSP comprising an amino acid sequence of SEQ ID NO: 165 
through 284. 

A mutein may be produced by isolation from a naturally-occurring mutant cell, 

1 5 tissue or organism. A mutein may be produced by isolation from a cell, tissue or 
organism that has been experimentally mutagenized. Alternatively, a mutein may be 
produced by chemical manipulation of a polypeptide, such as by altering the amino acid 
residue to another amino acid residue using synthetic or semi-synthetic chemical 
techniques. In a preferred embodiment, a mutein may be produced from a host cell 

20 comprising an altered nucleic acid molecule compared to the naturally-occurring nucleic 
acid molecule. For instance, one may produce a mutein of a polypeptide by introducing 
one or more mutations into a nucleic acid sequence of the invention and then expressing 
it recombinantly. These mutations may be targeted, in which particular encoded amino 
acids are altered, or may be untargeted, in which random encoded amino acids within the 

25 polypeptide are altered. Muteins with random amino acid alterations can be screened for 
a particular biological activity or property, particularly whether the polypeptide is lung- 
specific, as described below. Multiple random mutations can be introduced into the 
gene by methods well-known to the art, e.g., by error-prone PCR, shuffling, 
oligonucleotide-directed mutagenesis, assembly PCR, sexual PCR mutagenesis, in vivo 

30 mutagenesis, cassette mutagenesis, recursive ensemble mutagenesis, exponential 
ensemble mutagenesis and site-specific mutagenesis. Methods of producing muteins 
with targeted or random amino acid alterations are well-known in the art. See, e.g., 
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Sambrook (1989), supra; Sambrook (2001), supra; Ausubel (1992), supra; and Ausubel 
(1999), U.S. Patent 5,223,408, and the references discussed supra, each herein 
incorporated by reference. 

By "polypeptide" as used herein it is also meant to be inclusive of polypeptides 
5 homologous to those polypeptides exemplified herein. In a preferred embodiment, the 
polypeptide is homologous to an LSP. In an even more preferred embodiment, the 
polypeptide is homologous to an LSP selected from the group having an amino acid 
sequence of SEQ ED NO: 165 through 284. In a preferred embodiment, the homologous 
polypeptide is one that exhibits significant sequence identity to an LSP. In a more 

10 preferred embodiment, the polypeptide is one that exhibits significant sequence identity 
to an comprising an amino acid sequence of SEQ ID NO: 165 through 284. In an even 
more preferred embodiment, the homologous polypeptide is one that exhibits at least 
50% sequence identity, more preferably at least 60% sequence identity, even more 
preferably at least 70%, yet more preferably at least 80% sequence identity to an LSP 

1 5 comprising an amino acid sequence of SEQ ID NO: 1 65 through 284. In a yet more 
preferred embodiment, the homologous polypeptide is one that exhibits at least 85%, 
more preferably 90%, even more preferably 95% or 96%, and yet more preferably at 
least 97% or 98% sequence identity to an LSP comprising an amino acid sequence of 
SEQ ID NO: 165 through 284. In another preferred embodiment, the homologous 

20 polypeptide is one that exhibits at least 99%, more preferably 99.5%, even more 

preferably 99.6%, 99.7%, 99.8% or 99.9% sequence identity to an LSP comprising an 
amino acid sequence of SEQ ID NO: 165 through 284. In a preferred embodiment, the 
amino acid substitutions are conservative amino acid substitutions as discussed above. 
In another embodiment, the homologous polypeptide is one that is encoded by a 

25 nucleic acid molecule that selectively hybridizes to an LSNA. In a preferred 

embodiment, the homologous polypeptide is encoded by a nucleic acid molecule that 
hybridizes to an LSNA under low stringency, moderate stringency or high stringency 
conditions, as defined herein. In a more preferred embodiment, the LSNA is selected 
from the group consisting of SEQ ID NO: 1 through 164. In another preferred 

30 embodiment, the homologous polypeptide is encoded by a nucleic acid molecule that 
hybridizes to a nucleic acid molecule that encodes an LSP under low stringency, 
moderate stringency or high stringency conditions, as defined herein. In a more preferred 
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embodiment, the LSP is selected from the group consisting of SEQ ID NO: 165 through 
284, 

The homologous polypeptide maybe a naturally-occurring one that is derived 
from another species, especially one derived from another primate, such as chimpanzee, 
5 gorilla, rhesus macaque, baboon or gorilla, wherein the homologous polypeptide 

comprises an amino acid sequence that exhibits significant sequence identity to that of 
SEQ ID NO: 165 through 284. The homologous polypeptide may also be a naturally- 
occurring polypeptide from a human, when the LSP is a member of a family of 
polypeptides. The homologous polypeptide may also be a naturally-occurring 

10 polypeptide derived from a non-primate, mammalian species, including without 

limitation, domesticated species, e.g., dog, cat, mouse, rat, rabbit, guinea pig, hamster, 
cow, horse, goat or pig. The homologous polypeptide may also be a naturally-occurring 
polypeptide derived from a non-mammalian species, such as birds or reptiles. The 
naturally-occurring homologous protein may be isolated directly from humans or other 

15 species. Alternatively, the nucleic acid molecule encoding the naturally-occurring 
homologous polypeptide may be isolated and used to express the homologous 
polypeptide recombinantly. In another embodiment, the homologous polypeptide may be 
one that is experimentally produced by random mutation of a nucleic acid molecule and 
subsequent expression of the nucleic acid molecule. In another embodiment, the 

20 homologous polypeptide may be one that is experimentally produced by directed 

mutation of one or more codons to alter the encoded amino acid of an LSP. Further, the 
homologous protein may or may not encode polypeptide that is an LSP. However, in a 
preferred embodiment, the homologous polypeptide encodes a polypeptide that is an 
LSP. 

25 Relatedness of proteins can also be characterized using a second functional test, 

the ability of a first protein competitively to inhibit the binding of a second protein to an 
antibody. It is, therefore, another aspect of the present invention to provide isolated 
proteins not only identical in sequence to those described with particularity herein, but 
also to provide isolated proteins ("cross-reactive proteins") that competitively inhibit the 

30 binding of antibodies to all or to a portion of various of the isolated polypeptides of the 
present invention. Such competitive inhibition can readily be determined using 
immunoassays well-known in the art. 
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As discussed above, single nucleotide polymorphisms (SNPs) occur frequently in 
eukaryotic genomes, and the sequence determined from one individual of a species may 
differ from other allelic forms present within the population. Thus, by "polypeptide" as 
used herein it is also meant to be inclusive of polypeptides encoded by an allelic variant 
5 of a nucleic acid molecule encoding an LSP. In a preferred embodiment, the polypeptide 
is encoded by an allelic variant of a gene that encodes a polypeptide having the amino 
acid sequence selected from the group consisting of SEQ ID NO: 165 through 284. In a 
yet more preferred embodiment, the polypeptide is encoded by an allelic variant of a 
gene that has the nucleic acid sequence selected from the group consisting of SEQ ID 

10 NO: 1 through 164. 

In another embodiment, the invention provides polypeptides which comprise 
derivatives of a polypeptide encoded by a nucleic acid molecule according to the instant 
invention. In a preferred embodiment, the polypeptide is an LSP. In a preferred 
embodiment, the polypeptide has an amino acid sequence selected from the group 

15 consisting of SEQ ID NO: 165 through 284, or is a mutein, allelic variant, homologous 
protein or fragment thereof. In a preferred embodiment, the derivative has been 
acetylated, carboxylated, phosphorylated, glycosylated or ubiquitinated. In another 
preferred embodiment, the derivative has been labeled with, e.g., radioactive isotopes 
such as I25 I, 32 P, 35 S, and 3 H. In another preferred embodiment, the derivative has been 

20 labeled with fluorophores, chemiluminescent agents, enzymes, and antiligands that can 
serve as specific binding pair members for a labeled ligand. 

Polypeptide modifications are well-known to those of skill and have been 
described in great detail in the scientific literature. Several particularly common 
modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of 

25 glutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are described 
in most basic texts, such as, for instance Creighton, Protein Structure and Molecular 
Properties, 2nd ed., W. H. Freeman and Company (1993). Many detailed reviews are 
available on this subject, such as, for example, those provided by Wold, in Johnson (ed.), 
Posttranslational Covalent Modification of Proteins , pgs. 1-12, Academic Press (1983); 

30 Seifter et aL 9 Meth. Enzymol 182: 626-646 (1990) and Rattan et al, Ann. N.Y. Acad. Sci. 
663: 48-62 (1992). 
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It will be appreciated, as is well-known and as noted above, that polypeptides are 
not always entirely linear. For instance, polypeptides may be branched as a result of 
ubiquitination, and they may be circular, with or without branching, generally as a result 
of posttranslation events, including natural processing event and events brought about by 
5 human manipulation which do not occur naturally. Circular, branched and branched 
circular polypeptides may be synthesized by non-translation natural process and by 
entirely synthetic methods, as well. Modifications can occur anywhere in a polypeptide, 
including the peptide backbone, the amino acid side-chains and the amino or carboxyl 
termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or both, by a 

10 covalent modification, is common in naturally occurring and synthetic polypeptides and 
such modifications may be present in polypeptides of the present invention, as well. For 
instance, the amino terminal residue of polypeptides made in E. coli, prior to proteolytic 
processing, almost invariably will be N-formylmethionine. 

Useful post-synthetic (and post-translational) modifications include conjugation 

15 to detectable labels, such as fluorophores. A wide variety of amine-reactive and thiol- 
reactive fluorophore derivatives have been synthesized that react under nondenaturing 
conditions with N-terminal amino groups and epsilon amino groups of lysine residues, on 
the one hand, and with free thiol groups of cysteine residues, on the other. 

Kits are available commercially that permit conjugation of proteins to a variety of 

20 amine-reactive or thiol-reactive fluorophores: Molecular Probes, Inc. (Eugene, OR, 
USA), e.g., offers kits for conjugating proteins to Alexa Fluor 350, Alexa Fluor 430, 
Fluorescein-EX, Alexa Fluor 488, Oregon Green 488, Alexa Fluor 532, Alexa Fluor 546, 
Alexa Fluor 546, Alexa Fluor 568, Alexa Fluor 594, and Texas Red-X. 

A wide variety of other amine-reactive and thiol-reactive fluorophores are 

25 available commercially (Molecular Probes, Inc., Eugene, OR, USA), including Alexa 
Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 
568, Alexa Fluor® 594, Alexa Fluor® 647 (monoclonal antibody labeling kits available 
from Molecular Probes, Inc., Eugene, OR, USA), BODIPY dyes, such as BODIPY 
493/503, BODIPY FL, BODIPY R6G, BODIPY 530/550, BODIPY TMR, BODIPY 

30 558/568, BODIPY 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, 
BODIPY TR, BODIPY 630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, 
Dansyl, lissamine rhodamine B, Marina Blue, Oregon Green 488, Oregon Green 514, 
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Pacific Blue, rhodamine 6G, rhodamine green, rhodamine red, tetramethylrhodamine, 
Texas Red (available from Molecular Probes, Inc., Eugene, OR, USA). 

The polypeptides of the present invention can also be conjugated to fluorophores, 
other proteins, and other macromolecules, using Afunctional linking reagents. Common 
5 homobifunctional reagents include, e.g., APG, AEDP, BASED, BMB, BMDB, BMH, 
BMOE, BM[PEO]3, BM[PEO]4, BS3, BSOCOES, DFDNB, DMA, DMP, DMS, 
DPDPB, DSG, DSP (Lomant's Reagent), DSS, DST, DTBP, DTME, DTSSP, EGS, 
HBVS, Sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS (all available from Pierce, Rockford, 
IL, USA); common heterobifunctional cross-linkers include ABH, AMAS, ANB-NOS, 

1 0 APDP, ASB A, BMP A, BMPH, BMPS, EDC, EMCA, EMCH, EMCS, KMUA, KMUH, 
GMBS, LC-SMCC, LC-SPDP, MBS, M2C2H, MPBH, MSA, NHS-ASA, PDPH, PMPI, 
SADP, SAED, SAND, SANPAH, SASD, SATP, SBAP, SFAD, SIA, SIAB, SMCC, 
SMPB, SMPH, SMPT, SPDP, Sulfo-EMCS, Sulfo-GMBS, Sulfo-HSAB, Sulfo-KMUS, 
Sulfo-LC-SPDP, Sulfo-MBS, Sulfo-NHS-LC-ASA, Sulfo-SADP, Sulfo-SANPAH, 

15 Sulfo-SIAB, Sulfo-SMCC, Sulfo-SMPB, Sulfo-LC-SMPT, SVSB, TFCS (all available 
Pierce, Rockford, EL, USA). 

The polypeptides, fragments, and fusion proteins of the present invention can be 
conjugated, using such cross-linking reagents, to fluorophores that are not amine- or 
thiol-reactive. Other labels that usefully can be conjugated to the polypeptides, 

20 fragments, and fusion proteins of the present invention include radioactive labels, 
echosonographic contrast reagents, and MRI contrast agents. 

The polypeptides, fragments, and fusion proteins of the present invention can also 
usefully be conjugated using cross-linking agents to carrier proteins, such as KLH, 
bovine thyroglobulin, and even bovine serum albumin (BSA), to increase 

25 immunogenicity for raising anti-LSP antibodies. 

The polypeptides, fragments, and fusion proteins of the present invention can also 
usefully be conjugated to polyethylene glycol (PEG); PEGylation increases the serum 
half-life of proteins administered intravenously for replacement therapy. Delgado et al. 9 
Crit Rev. Ther. Drug Carrier Syst 9(3-4): 249-304 (1992); Scott et al, Cum Pharrn. 

30 Des. 4(6): 423-38 (1998); DeSantis et al, Curr. Opin. Biotechnol 10(4): 324-30 (1999), 
incorporated herein by reference in their entireties. PEG monomers can be attached to 
the protein directly or through a linker, with PEGylation using PEG monomers activated 
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with tresyl chloride (2,2,2-trifluoroethanesulphonyl chloride) permitting direct 
attachment under mild conditions. 

In yet another embodiment, the invention provides analogs of a polypeptide 
encoded by a nucleic acid molecule according to the instant invention. In a preferred 
5 embodiment, the polypeptide is an LSP. In a more preferred embodiment, the analog is 
derived from a polypeptide having part or all of the amino acid sequence of SEQ ID NO: 
165 through 284. In a preferred embodiment, the analog is one that comprises one or 
more substitutions of non-natural amino acids or non-native inter-residue bonds 
compared to the naturally-occurring polypeptide. In general, the non-peptide analog is 

1 0 structurally similar to an LSP, but one or more peptide linkages is replaced by a linkage 
selected from the group consisting of --CH2NH--, --CH2S--, --CH2-CH2-, 
-CH=CH»(cis and trans), --COCH 2 ~, ~CH(OH)CH 2 » and -CH 2 SO-. In another 
embodiment, the non-peptide analog comprises substitution of one or more amino acids 
of an LSP with a D-amino acid of the same type or other non-natural amino acid in order 

15 to generate more stable peptides. D-amino acids can readily be incorporated during 

chemical peptide synthesis: peptides assembled from D-amino acids are more resistant to 
proteolytic attack; incorporation of D-amino acids can also be used to confer specific 
three-dimensional conformations on the peptide. Other amino acid analogues commonly 
added during chemical synthesis include ornithine, norleucine, phosphorylated amino 

20 acids (typically phosphoserine, phosphothreonine, phosphotyrosine), L-malonyltyrosine, 
a non-hydrolyzable analog of phosphotyrosine {see, e.g., Kole et al.,Biochem. Biophys. 
Res. Com. 209: 817-821 (1995)), and various halogenated phenylalanine derivatives. 

Non-natural amino acids can be incorporated during solid phase chemical 
synthesis or by recombinant techniques, although the former is typically more common. 

25 Solid phase chemical synthesis of peptides is well established in the art. Procedures are 
described, inter aha, in Chan et at (eds.), Fmoc Solid Phase Peptide Synthesis: A 
Practical Approach (Practical Approach Series), Oxford Univ. Press (March 2000); 
Jones, Amino Acid and Peptide Synthesis (Oxford Chemistry Primers, No 7), Oxford 
Univ. Press (1992); and Bodanszky, Principles of Peptide Synthesis (Springer 

30 Laboratory), Springer Verlag (1993); the disclosures of which are incorporated herein by 
reference in their entireties. 
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Araino acid analogues having detectable labels are also usefully incorporated 
during synthesis to provide derivatives and analogs. Biotin, for example can be added 
using biotinoyl-(9-fluorenylmethoxycarbonyl)-L-lysine (FMOC biocytin) (Molecular 
Probes, Eugene, OR, USA). Biotin can also be added enzymatically by incorporation 
5 into a fusion protein of a E. coli BirA substrate peptide. The FMOC and /BOC 

derivatives of dabcyl-L-lysine (Molecular Probes, Inc., Eugene, OR, USA) can be used to 
incorporate the dabcyl chromophore at selected sites in the peptide sequence during 
synthesis. The aminonaphthalene derivative EDANS, the most common fluorophore for 
pairing with the dabcyl quencher in fluorescence resonance energy transfer (FRET) 

10 systems, can be introduced during automated synthesis of peptides by using 

EDANS-FMOC-L-glutamic acid or the corresponding /BOC derivative (both from 
Molecular Probes, Inc., Eugene, OR, USA). Tetramethylrhodamine fluorophores can be 
incorporated during automated FMOC synthesis of peptides using 
(FMOC)-TMR-L-lysine (Molecular Probes, Inc. Eugene, OR, USA). 

1 5 Other useful amino acid analogues that can be incorporated during chemical 

synthesis include aspartic acid, glutamic acid, lysine, and tyrosine analogues having allyl 
side-chain protection (Applied Biosystems, Inc., Foster City, CA, USA); the allyl side 
chain permits synthesis of cyclic, branched-chain, sulfonated, glycosylated, and 
phosphorylated peptides. 

20 A large number of other FMOC-protected non-natural amino acid analogues 

capable of incorporation during chemical synthesis are available commercially, 
including, e.g., Fmoc-2-aminobicyclo[2.2.1]heptane-2-carboxylic acid, Fmoc-3-endo- 
aminobicyclo[2.2.1]heptane-2-endo-carboxylic acid, Fmoc-3-exo- 
aminobicyclo[2.2.1]heptane-2-exo-carboxylic acid, Fmoc-3-endo-amino- 

25 bicyclo[2.2. l]hept-5-ene-2-endo-carboxylic acid, Fmoc-3-exo-amino-bicyclo[2.2.1]hept- 
5-ene-2-exo-carboxylic acid, Fmoc-cis-2-amino-l-cyclohexanecarboxylic acid, Fmoc- 
trans-2-amino-l-cyclohexanecarboxylic acid, Fmoc-1 -amino- 1-cyclopentanecarboxylic 
acid, Fmoc-cis-2-amino-l-cyclopentanecarboxylic acid, Fmoc-l-amino-1- 
cyclopropanecarboxylic acid, Fmoc-D-2-amino-4-(ethylthio)butyric acid, Fmoc-L-2- 

30 amino-4-(ethylthio)butyric acid, Fmoc-L-buthionine, Fmoc-S-methyl-L-Cysteine, Fmoc- 
2-aminobenzoic acid (anthranillic acid), Fmoc-3-aminobenzoic acid, Fmoc-4- 
aminobenzoic acid, Fmoc-2-aminobenzophenone-2'-carboxylic acid, Fmoc-N-(4- 
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aminobenzoyl)-P-alanine, Fmoc-2-amino-4,5-chmethoxybenzoic acid, Fmoc-4- 
aminohippuric acid, Fmoc-2-amino-3-hydroxybenzoic acid, Fmoc-2-amino-5- 
hydroxybenzoic acid, Fmoc-3-amino-4-hydroxybenzoic acid, Fmoc-4-amino-3- 
hydroxybenzoic acid, Fmoc-4-amino-2-hydroxybenzoic acid, Fmoc-5-amino-2- 
5 hydroxybenzoic acid, Fmoc-2-amino~3-methoxybenzoic acid, Fmoc-4-amino-3- 
methoxybenzoic acid, Fmoc-2-amino-3-methylbenzoic acid, Fmoc-2-amino-5- 
methylbenzoic acid, Fmoc-2-amino-6-methylbenzoic acid, Fmoc-3-amino-2- 
methylbenzoic acid, Fmoc-3-amino-4-methylbenzoic acid, Fmoc-4-amino-3- 
methylbenzoic acid, Fmoc-3-amino-2-naphtoic acid, Fmoc-D,L-3-amino-3- 

1 0 phenylpropionic acid, Fmoc-L-Methyldopa, Fmoc-2-amino-4,6-dimethyl-3- 
pyridinecarboxylic acid, Fmoc-D,L-amino-2-thiophenacetic acid, Fmoc-4- 
(carboxymethyl)piperazine, Fmoc-4-carboxypiperazine, Fmoc-4- 
(carboxymethyl)homopiperazine, Fmoc-4-phenyl-4-piperidinecarboxylic acid, Fmoc-L- 
l,2,3,44etrahydronorharman-3-carboxylic acid, Fmoc-L-thiazolidine-4-carboxylic acid, 

1 5 all available from The Peptide Laboratory (Richmond, CA, USA). 

Non-natural residues can also be added biosynthetically by engineering a 
suppressor tRNA, typically one that recognizes the UAG stop codon, by chemical 
aminoacylation with the desired unnatural amino acid. Conventional site-directed 
mutagenesis is used to introduce the chosen stop codon UAG at the site of interest in the 

20 protein gene. When the acylated suppressor tRNA and the mutant gene are combined in 
an in vitro transcription/translation system, the unnatural amino acid is incorporated in 
response to the UAG codon to give a protein containing that amino acid at the specified 
position. Liu et aL, Proa Natl Acad. Set USA 96(9): 4780-5 (1999); Wang et al. 9 
Science 292(5516): 498-500 (2001). 

25 Fusion Proteins 

The present invention further provides fusions of each of the polypeptides and 
fragments of the present invention to heterologous polypeptides. In a preferred 
embodiment, the polypeptide is an LSP. In a more preferred embodiment, the 
polypeptide that is fused to the heterologous polypeptide comprises part or all of the 

30 amino acid sequence of SEQ ID NO: 165 through 284, or is a mutein, homologous 
polypeptide, analog or derivative thereof. In an even more preferred embodiment, the 
nucleic acid molecule encoding the fusion protein comprises all or part of the nucleic 
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acid sequence of SEQ ID NO: 1 through 164, or comprises all or part of a nucleic acid 
sequence that selectively hybridizes or is homologous to a nucleic acid molecule 
comprising a nucleic acid sequence of SEQ ID NO: 1 through 164. 

The fusion proteins of the present invention will include at least one fragment of 
5 the protein of the present invention, which fragment is at least 6, typically at least 8, often 
at least 15, and usefully at least 16, 17, 18, 19, or 20 amino acids long. The fragment of 
the protein of the present to be included in the fusion can usefully be at least 25 amino 
acids long, at least 50 amino acids long, and can be at least 75, 100, or even 150 amino 
acids long. Fusions that include the entirety of the proteins of the present invention have 

10 particular utility. 

The heterologous polypeptide included within the fusion protein of the present 
invention is at least 6 amino acids in length, often at least 8 amino acids in length, and 
usefully at least 15, 20, and 25 amino acids in length. Fusions that include larger 
polypeptides, such as the IgG Fc region, and even entire proteins (such as GFP 

1 5 chromophore-containing proteins) are particular useful. 

As described above in the description of vectors and expression vectors of the 
present invention, which discussion is incorporated here by reference in its entirety, 
heterologous polypeptides to be included in the fusion proteins of the present invention 
can usefully include those designed to facilitate purification and/or visualization of 

20 recombinantly-expressed proteins. See, e.g., Ausubel, Chapter 16, (1992), supra. 
Although purification tags can also be incorporated into fusions that are chemically 
synthesized, chemical synthesis typically provides sufficient purity that further 
purification by HPLC suffices; however, visualization tags as above described retain 
their utility even when the protein is produced by chemical synthesis, and when so 

25 included render the fusion proteins of the present invention useful as directly detectable 
markers of the presence of a polypeptide of the invention. 

As also discussed above, heterologous polypeptides to be included in the fusion 
proteins of the present invention can usefully include those that facilitate secretion of 
recombinantly expressed proteins — into the periplasmic space or extracellular milieu for 

30 prokaryotic hosts, into the culture medium for eukaryotic cells — through incorporation 
of secretion signals and/or leader sequences. For example, a His 6 tagged protein can be 
purified on a Ni affinity column and a GST fusion protein can be purified on a 
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glutathione affinity column. Similarly, a fusion protein comprising the Fc domain of IgG 
can be purified on a Protein A or Protein G column and a fusion protein comprising an 
epitope tag such as myc can be purified using an immunoaffinity column containing an 
anti-c-myc antibody. It is preferable that the epitope tag be separated from the protein 
5 encoded by the essential gene by an enzymatic cleavage site that can be cleaved after 
purification. See also the discussion of nucleic acid molecules encoding fusion proteins 
that may be expressed on the surface of a cell. 

Other useful protein fusions of the present invention include those that permit use 
of the protein of the present invention as bait in a yeast two-hybrid system. See Bartel et 

10 al (eds.), The Yeast Two-Hybrid System, Oxford University Press (1997); Zhu et al, 
Yeast Hybrid Technologies. Eaton Publishing (2000); Fields et al, Trends Genet 10(8): 
286-92 (1994); Mendelsohn etal, Curr. Opin. Biotechnol 5(5): 482-6 (1994); Luban et 
al, Curr. Opin. Biotechnol 6(1): 59-64 (1995); Allen etal, Trends Biochem. Sci 
20(12): 511-6 (1995); Drees, Curr. Opin. Chem. Biol 3(1): 64-70 (1999); Topcu et al, 

15 Pharm. Res. 17(9): 1049-55 (2000); Fashena et al, Gene 250(1-2): 1-14 (2000); ; Colas 
et al, (1996) Genetic selection of peptide aptamers that recognize and inhibit cyclin- 
dependent kinase 2. Nature 380, 548-550; Norman, T. et al, (1999) Genetic selection of 
peptide inhibitors of biological pathways. Science 285, 591-595, Fabbrizio et al, (1999) 
Inhibition of mammalian cell proliferation by genetically selected peptide aptamers that 

20 functionally antagonize E2F activity. Oncogene 18, 4357-4363; Xu et al, (1997) Cells 
that register logical relationships among proteins. Proc Natl Acad Sci USA. 94, 12473- 
12478; Yang, et al, (1995) Protein-peptide interactions analyzed with the yeast two- 
hybrid system. Nuc. Acids Res. 23, 1 152-1 156; Kolonin et al, (1998) Targeting cyclin- 
dependent kinases in Drosophila with peptide aptamers. Proc Natl Acad Sci USA95, 

25 14266-14271; Cohen et al, (1998) An artificial cell-cycle inhibitor isolated from a 

combinatorial library. Proc Natl Acad Sci U S A 95, 14272-14277; Uetz, P.; Giot, L.; al, 
e.; Fields, S.; Rothberg, J. M. (2000) A comprehensive analysis of protein-protein 
interactions in Saccharomyces cerevisiae. Nature 403, 623-627; Ito, et al, (2001) A 
comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc Natl 

30 Acad Sci USA98, 4569-4574, the disclosures of which are incorporated herein by 
reference in their entireties. Typically, such fusion is to either E. coli LexA or yeast 
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GAL4 DNA binding domains. Related bait plasmids are available that express the bait 
fused to a nuclear localization signal. 

Other useful fusion proteins include those that permit display of the encoded 
protein on the surface of a phage or cell, fusions to intrinsically fluorescent proteins, such 
5 as green fluorescent protein (GFP), and fusions to the IgG Fc region, as described above, 
which discussion is incorporated here by reference in its entirety. 

The polypeptides and fragments of the present invention can also usefully be 
fused to protein toxins, such as Pseudomonas exotoxin A, diphtheria toxin, shiga toxin 
A, anthrax toxin lethal factor, ricin, in order to effect ablation of cells that bind or take up 

10 the proteins of the present invention. 

Fusion partners include, inter alia, myc, hemagglutinin (HA), GST, 
immunoglobulins, P-galactosidase, biotin trpE, protein A, p-lactamase, -amylase, 
maltose binding protein, alcohol dehydrogenase, polyhistidine (for example, six histidine 
at the amino and/or carboxyl terminus of the polypeptide), lacZ, green fluorescent protein 

15 (GFP), yeast mating factor, GAL4 transcription activation or DNA binding domain, 
luciferase, and serum proteins such as ovalbumin, albumin and the constant domain of 
IgG. See, e.g., Ausubel (1992), supra and Ausubel (1999), supra. Fusion proteins may 
also contain sites for specific enzymatic cleavage, such as a site that is recognized by 
enzymes such as Factor XIII, trypsin, pepsin, or any other enzyme known in the art. 

20 Fusion proteins will typically be made by either recombinant nucleic acid methods, as 
described above, chemically synthesized using techniques well-known in the art (e.g., a 
Merrifield synthesis), or produced by chemical cross-Unking. 

Another advantage of fusion proteins is that the epitope tag can be used to bind 
the fusion protein to a plate or column through an affinity linkage for screening binding 

25 proteins or other molecules that bind to the LSP. 

As further described below, the isolated polypeptides, muteins, fusion proteins, 
homologous proteins or allelic variants of the present invention can readily be used as 
specific immunogens to raise antibodies that specifically recognize LSPs, their allelic 
variants and homologues. The antibodies, in turn, can be used, inter alia, specifically to 

30 assay for the polypeptides of the present invention, particularly LSPs, e.g. by ELISA for 
detection of protein fluid samples, such as serum, by immunohistochemistry or laser 
scanning cytometry, for detection of protein in tissue samples, or by flow cytometry, for 
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detection of intracellular protein in cell suspensions, for specific antibody-mediated 
isolation and/or purification of LSPs, as for example by immunoprecipitation, and for use 
as specific agonists or antagonists of LSPs. 

One may determine whether polypeptides including muteins, fusion proteins, 
5 homologous proteins or allelic variants are functional by methods known in the art. For 
instance, residues that are tolerant of change while retaining function can be identified by 
altering the protein at known residues using methods known in the art, such as alanine 
scanning mutagenesis, Cunningham et ai, Science 244(4908): 1081-5 (1989); transposon 
linker scanning mutagenesis, Chen et al y Gene 263(1-2): 39-48 (2001); combinations of 

10 homolog- and alanine-scanning mutagenesis, Jin et al, J. Mol. Biol. 226(3): 851-65 
(1992); combinatorial alanine scanning, Weiss et al 9 Proc. Natl Acad. Sci USA 97(16): 
8950-4 (2000), followed by functional assay. Transposon linker scanning kits are 
available commercially (New England Biolabs, Beverly, MA, USA, catalog, no. E7- 
102S; EZ::TN™ Ih-Frame Linker Insertion Kit, catalogue no. EZI04KN, Epicentre 

15 Technologies Corporation, Madison, WI, USA). 

Purification of the polypeptides including fragments, homologous polypeptides, 
muteins, analogs, derivatives and fusion proteins is well-known and within the skill of 
one having ordinary skill in the art. See, e.g., Scopes, Protein Purification, 2d ed. (1987). 
Purification of recombinantly expressed polypeptides is described above. Purification of 

20 chemically-synthesized peptides can readily be effected, e.g., by HPLC. 

Accordingly, it is an aspect of the present invention to provide the isolated 
proteins of the present invention in pure or substantially pure form in the presence of 
absence of a stabilizing agent. Stabilizing agents include both proteinaceous or non- 
proteinaceous material and are well-known in the art. Stabilizing agents, such as albumin 

25 and polyethylene glycol (PEG) are known and are commercially available. 

Although high levels of purity are preferred when the isolated proteins of the 
present invention are used as therapeutic agents, such as in vaccines and as replacement 
therapy, the isolated proteins of the present invention are also useful at lower purity. For 
example, partially purified proteins of the present invention can be used as immunogens 

30 to raise antibodies in laboratory animals. 
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In preferred embodiments, the purified and substantially purified proteins of the 
present invention are in compositions that lack detectable ampholytes, acrylamide 
monomers, bis-acrylamide monomers, and polyacrylamide. 

The polypeptides, fragments, analogs, derivatives and fusions of the present 
5 invention can usefully be attached to a substrate. The substrate can be porous or solid, 
planar or non-planar; the bond can be covalent or noncovalent 

For example, the polypeptides, fragments, analogs, derivatives and fusions of the 
present invention can usefully be bound to a porous substrate, commonly a membrane, 
typically comprising nitrocellulose, polyvinylidene fluoride (PVDF), or cationically 

10 derivatized, hydrophilic PVDF; so bound, the proteins, fragments, and fusions of the 
present invention can be used to detect and quantify antibodies, e.g. in serum, that bind 
specifically to the immobilized protein of the present invention. 

As another example, the polypeptides, fragments, analogs, derivatives and fusions 
of the present invention can usefully be bound to a substantially nonporous substrate, 

15 such as plastic, to detect and quantify antibodies, e.g. in serum, that bind specifically to 
the immobilized protein of the present invention. Such plastics include 
polymethylacrylic, polyethylene, polypropylene, polyacrylate, polymethylmethacrylate, 
polyvinylchloride, polytetrafluoroethylene, polystyrene, polycarbonate, polyacetal, 
polysulfone, celluloseacetate, cellulosenitrate, nitrocellulose, or mixtures thereof; when 

20 the assay is performed in a standard microtiter dish, the plastic is typically polystyrene. 
The polypeptides, fragments, analogs, derivatives and fusions of the present 
invention can also be attached to a substrate suitable for use as a surface enhanced laser 
desorption ionization source; so attached, the protein, fragment, or fusion of the present 
invention is useful for binding and then detecting secondary proteins that bind with 

25 sufficient affinity or avidity to the surface-bound protein to indicate biologic interaction 
there between. The proteins, fragments, and fusions of the present invention can also be 
attached to a substrate suitable for use in surface plasmon resonance detection; so 
attached, the protein, fragment, or fusion of the present invention is useful for binding 
and then detecting secondary proteins that bind with sufficient affinity or avidity to the 

30 surface-bound protein to indicate biological interaction there between. 
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Antibodies 

In another aspect, the invention provides antibodies, including fragments and 
derivatives thereof, that bind specifically to polypeptides encoded by the nucleic acid 
5 molecules of the invention, as well as antibodies that bind to fragments, muteins, 

derivatives and analogs of the polypeptides. In a preferred embodiment, the antibodies 
are specific for a polypeptide that is an LSP, or a fragment, mutein, derivative, analog or 
fusion protein thereof. In a more preferred embodiment, the antibodies are specific for a 
polypeptide that comprises SEQ ED NO: 165 through 284, or a fragment, mutein, 

10 derivative, analog or fusion protein thereof. 

The antibodies of the present invention can be specific for linear epitopes, 
discontinuous epitopes, or conformational epitopes of such proteins or protein fragments, 
either as present on the protein in its native conformation or, in some cases, as present on 
the proteins as denatured, as, e.g., by solubilization in SDS. New epitopes may be also 

15 due to a difference in post translational modifications (PTMs) in disease versus normal 
tissue. For example, a particular site on a LSP may be glycosylated in cancerous cells, 
but not glycosylated in normal cells or visa versa. In addition, alternative splice forms 
of a LSP may be indicative of cancer. Differential degradation of the C or N-terminus 
of a LSP may also be a marker or target for anticancer therapy. For example, a LSP may 

20 be N-terminal degraded in cancer cells exposing new epitopes to which antibodies may 
selectively bind for diagnostic or therapeutic uses. 

As is well-known in the art, the degree to which an antibody can discriminate as 
among molecular species in a mixture will depend, in part, upon the conformational 
relatedness of the species in the mixture; typically, the antibodies of the present invention 

25 will discriminate over adventitious binding to non-LSP polypeptides by at least 2-fold, 
more typically by at least 5-fold, typically by more than 10-fold, 25-fold, 50-fold, 75- 
fold, and often by more than 100-fold, and on occasion by more than 500-fold or 1000- 
fold. When used to detect the proteins or protein fragments of the present invention, the 
antibody of the present invention is sufficiently specific when it can be used to determine 

30 the presence of the protein of the present invention in samples derived from human lung. 

Typically, the affinity or avidity of an antibody (or antibody multimer, as in the 
case of an IgM pentamer) of the present invention for a protein or protein fragment of the 
present invention will be at least about 1 x 10* 6 molar (M), typically at least about 5x10" 
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7 M, 1 x 10' 7 M, with affinities and avidities of at least 1 x 10" 8 M, 5 x 10" 9 M, 1 x 10" 10 M 
and up to 1 X 10" 13 M proving especially useful. 

The antibodies of the present invention can be naturally-occurring forms, such as 
IgG, IgM, IgD, IgE, IgY, and IgA, from any avian, reptilian, or mammalian species. 
5 Human antibodies can, but will infrequently, be drawn directly from human 

donors or human cells. In this case, antibodies to the proteins of the present invention 
will typically have resulted from fortuitous immunization, such as autoimmune 
immunization, with the protein or protein fragments of the present invention. Such 
antibodies will typically, but will not invariably, be polyclonal. In addition, individual 

10 polyclonal antibodies may be isolated and cloned to generate monoclonals. 

Human antibodies are more frequently obtained using transgenic animals that 
express human immunoglobulin genes, which transgenic animals can be affirmatively 
immunized with the protein immunogen of the present invention. Human Ig-transgenic 
mice capable of producing human antibodies and methods of producing human 

15 antibodies therefrom upon specific immunization are described, inter alia, in U.S. Patents 
6,162,963; 6,150,584; 6,114,598; 6,075,181; 5,939,598; 5,877,397; 5,874,299; 
5,814,318; 5,789,650; 5,770,429; 5,661,016; 5,633,425; 5,625,126; 5,569,825; 
5,545,807; 5,545,806, and 5,591,669, the disclosures of which are incorporated herein by 
reference in their entireties. Such antibodies are typically monoclonal, and are typically 

20 produced using techniques developed for production of murine antibodies. 

Human antibodies are particularly useful, and often preferred, when the 
antibodies of the present invention are to be administered to human beings as in vivo 
diagnostic or therapeutic agents, since recipient immune response to the administered 
antibody will often be substantially less than that occasioned by administration of an 

25 antibody derived from another species, such as mouse. 

IgG, IgM, IgD, IgE, IgY, and IgA antibodies of the present invention can also be 
obtained from other species, including mammals such as rodents (typically mouse, but 
also rat, guinea pig, and hamster) lagomorphs, typically rabbits, and also larger 
mammals, such as sheep, goats, cows, and horses, and other egg laying birds or reptiles 

30 such as chickens or alligators. For example, avian antibodies may be generated using 
techniques described in WO 00/29444, published 25 May 2000, the contents of which are 
hereby incorporated in their entirety. In such cases, as with the transgenic human- 
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antibody-producing non-human mammals, fortuitous immunization is not required, and 
the non-human mammal is typically affirmatively immunized, according to standard 
immunization protocols, with the protein or protein fragment of the present invention. 
As discussed above, virtually all fragments of 8 or more contiguous amino acids 
5 of the proteins of the present invention can be used effectively as immunogens when 
conjugated to a carrier, typically a protein such as bovine thyroglobulin, keyhole limpet 
hemocyanin, or bovine serum albumin, conveniently using a bifunctional linker such as 
those described elsewhere above, which discussion is incorporated by reference here. 

hnmunogenicity can also be conferred by fusion of the polypeptide and fragments 

10 of the present invention to other moieties. For example, peptides of the present invention 
can be produced by solid phase synthesis on a branched polylysine core matrix; these 
multiple antigenic peptides (MAPs) provide high purity, increased avidity, accurate 
chemical definition and improved safety in vaccine development. Tarn et aL, Proa Natl 
Acad. Sci. USA 85: 5409-5413 (1988);PosnetU*a/.,/. Biol Chem. 263: 1719-1725 

15 (1988). 

Protocols for immunizing non-human mammals or avian species are well- 
established in the art. See Harlow et aL (eds.), Using Antibodies: A Laboratory Manual , 
Cold Spring Harbor Laboratory (1998); Coligan et aL (eds.), Current Protocols in 
Immunology. John Wiley & Sons, Inc. (2001); Zola, Monoclonal Antibodies: Preparation 

20 and Use of Monoclonal Antibodies and Engineered Antibody Derivatives (Basics: From 
Background to Bench), Springer Verlag (2000); Gross M, Speck J.Dtsch. TierarztL 
Wochenschr, 103: 417-422 (1996), the disclosures of which are incorporated herein by 
reference. Immunization protocols often include multiple immunizations, either with or 
without adjuvants such as Freund's complete adjuvant and Freund's incomplete adjuvant, 

25 and may include naked DNA immunization (Moss, Semin. Immunol 2:317-327(1990). 
Antibodies from non-human mammals and avian species can be polyclonal or 
monoclonal, with polyclonal antibodies having certain advantages in 
immunohistochemical detection of the proteins of the present invention and monoclonal 
antibodies having advantages in identifying and distinguishing particular epitopes of the 

30 proteins of the present invention. Antibodies from avian species may have particular 

advantage in detection of the proteins of the present invention, in human serum or tissues 
(Vikinge et aL, Biosens. Bioelectron. 13: 1257-1262 (1998). 
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Following immunization, the antibodies of the present invention can be produced 
using any art-accepted technique. Such techniques are well-known in the art, Coligan, 
supra; Zola, supra; Howard et al (eds.), Basic Methods in Antibody Production and 
Characterization. CRC Press (2000); Harlow, supra; Davis (ed.), Monoclonal Antibody 
5 Protocols. Vol. 45, Humana Press (1995); Delves (ed.), Antibody Production: Essential 
Techniques . John Wiley & Son Ltd (1997); Kenney, Antibody Solution: An Antibody 
Methods Manual. Chapman & Hall (1997), incorporated herein by reference in their 
entireties, and thus need not be detailed here. 

Briefly, however, such techniques include, inter alia, production of monoclonal 

1 0 antibodies by hybridomas and expression of antibodies or fragments or derivatives 
thereof from host cells engineered to express immunoglobulin genes or fragments 
thereof These two methods of production are not mutually exclusive: genes encoding 
antibodies specific for the proteins or protein fragments of the present invention can be 
cloned from hybridomas and thereafter expressed in other host cells. Nor need the two 

15 necessarily be performed together: e.g., genes encoding antibodies specific for the 
proteins and protein fragments of the present invention can be cloned directly from B 
cells known to be specific for the desired protein, as further described in U.S Patent 
5,627,052, the disclosure of which is incorporated herein by reference in its entirety, or 
from antibody-displaying phage. 

20 Recombinant expression in host cells is particularly useful when fragments or 

derivatives of the antibodies of the present invention are desired. 

Host cells for recombinant production of either whole antibodies, antibody 
fragments, or antibody derivatives can be prokaryotic or eukaryotic. 

Prokaryotic hosts are particularly useful for producing phage displayed antibodies 

25 of the present invention. 

The technology of phage-displayed antibodies, in which antibody variable region 
fragments are fused, for example, to the gene m protein (pHQ or gene VIE protein 
(pVm) for display on the surface of filamentous phage, such as Ml 3, is by now well- 
established. See, e.g., Sidhu, Cum Opin. Biotechnol 11(6): 610-6 (2000); Griffiths et 

30 al, Curr. Opin. Biotechnol 9(1): 102-8 (1998); Hoogenboom et al, Immunotechnology, 
4(1): 1-20 (1998); Rader etal, Current Opinion in Biotechnology 8: 503-508 (1997); 
Aujame et al, Human Antibodies 8: 155-168 (1997); Hoogenboom, Trends in 
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Biotechnol 15: 62-70 (1997); deKruif etal, 17: 453-455 (1996); Barbas etal, Trends in 
Biotechnol 14: 230-234 (1996); Winter et al.,Ann. Rev. Immunol 433-455 (1994). 
Techniques and protocols required to generate, propagate, screen (pan), and use the 
antibody fragments from such libraries have recently been compiled. See, e.g., Barbas 
5 (2001), supra; Kay, supra; Abelson, supra, the disclosures of which are incorporated 
herein by reference in their entireties. 

Typically, phage-displayed antibody fragments are scFv fragments or Fab 
fragments; when desired, full length antibodies can be produced by cloning the variable 
regions from the displaying phage into a complete antibody and expressing the full length 

10 antibody in a further prokaryotic or a eukaryotic host cell. 

Eukaryotic cells are also useful for expression of the antibodies, antibody 
fragments, and antibody derivatives of the present invention. 

For example, antibody fragments of the present invention can be produced in 
Pichia pastoris and in Saccharomyces cerevisiae. See, e.g., Takahashi et al, Biosci. 

15 Biotechnol Biochem. 64(10): 2138-44 (2000); Freyre et al, J. Biotechnol. 76(2-3):l 
57-63 (2000); Fischer et al, Biotechnol Appl Biochem. 30 (Pt 2): 1 17-20 (1999); 
Pennell et al, Res. Immunol 149(6): 599-603 (1998); Eldin et al, J. Immunol Methods. 
201(1): 67-75 (1997);, Frenken et al, Res. Immunol. 149(6): 589-99 (1998); Shusta et al, 
Nature Biotechnol 16(8): 773-7 (1998), the disclosures of which are incorporated herein 

20 by reference in their entireties. 

Antibodies, including antibody fragments and derivatives, of the present 
invention can also be produced in insect cells. See, e.g., Li et al, Protein Expr Purif 
21(1): 121-8 (2001); Ailor et al, Biotechnol Bioeng. 58(2-3): 196-203 (1998); Hsu et al, 
Biotechnol Prog. 13(1): 96-104 (1997); Edelman et al, Immunology 91(1): 13-9 (1997); 

25 and Nesbit et al, J. Immunol Methods 151(1-2): 201-8 (1992), the disclosures of which 
are incorporated herein by reference in their entireties. 

Antibodies and fragments and derivatives thereof of the present invention can 
also be produced in plant cells, particularly maize or tobacco, Giddings et al, Nature 
Biotechnol 18(11): 1151-5 (2000); Gavilondo etal, Biotechiiques 29(1): 128-38 (2000); 

30 Fischer et al, J. Biol Regul Homeost. Agents 14(2): 83-92 (2000); Fischer et al, 

Biotechnol Appl Biochem. 30 (Pt 2): 113-6 (1999); Fischer etal, Biol Chem. 380(7-8): 
825-39 (1999); Russell, Curr. Top. Microbiol Immunol 240: 119-38 (1999); and Ma et 
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al 9 Plant Physiol 109(2); 341-6 (1995), the disclosures of which are incorporated herein 
by reference in their entireties. 

Antibodies, including antibody fragments and derivatives, of the present 
invention can also be produced in transgenic, non-human, mammalian milk. See, e.g. 
5 Pollock et al., J, Immunol Methods. 231: 147-57 (1999); Young et al, Res. Immunol 
149: 609-10 (1998); Limonta et al., Immunotechnology 1: 107-13 (1995), the disclosures 
of which are incorporated herein by reference in their entireties. 

Mammalian cells useful for recombinant expression of antibodies, antibody 
fragments, and antibody derivatives of the present invention include CHO cells, COS 
10 cells, 293 cells, and myeloma cells. 

Verma et al,J. Immunol Methods 216(1-2):165-81 (1998), herein incorporated 
by reference, review and compare bacterial, yeast, insect and mammalian expression 
systems for expression of antibodies. 

Antibodies of the present invention can also be prepared by cell free translation, 
15 as further described in Merk et al., J. Biochem. (Tokyo) 125(2): 328-33 (1999) and 
Ryabova et al, Nature Biotechnol 15(1): 79-84 (1997), and in the milk of transgenic 
animals, as further described in Pollock et al, J. Immunol Methods 231(1-2): 147-57 
(1999), the disclosures of which are incorporated herein by reference in their entireties. 

The invention further provides antibody fragments that bind specifically to one or 
20 more of the proteins and protein fragments of the present invention, to one or more of the 
proteins and protein fragments encoded by the isolated nucleic acids of the present 
invention, or the binding of which can be competitively inhibited by one or more of the 
proteins and protein fragments of the present invention or one or more of the proteins and 
protein fragments encoded by the isolated nucleic acids of the present invention. 
25 Among such useful fragments are Fab, Fab', Fv, F(ab)'2» and single chain Fv 

(scFv) fragments. Other useful fragments are described in Hudson, Curr. Opin. 
Biotechnol 9(4): 395-402 (1998). 

It is also an aspect of the present invention to provide antibody derivatives that 
bind specifically to one or more of the proteins and protein fragments of the present 
30 invention, to one or more of the proteins and protein fragments encoded by the isolated 
nucleic acids of the present invention, or the binding of which can be competitively 
inhibited by one or more of the proteins and protein fragments of the present invention or 



WO 02/068633 PCT/US01/43612 

-84- 

one or more of the proteins and protein fragments encoded by the isolated nucleic acids 
of the present invention. 

Among such useful derivatives are chimeric, primatized, and humanized 
antibodies; such derivatives are less immunogenic in human beings, and thus more 
5 suitable for in vivo administration, than are unmodified antibodies from non-human 
mammalian species. Another useful derivative is PEGylation to increase the serum half 
life of the antibodies. 

Chimeric antibodies typically include heavy and/or light chain variable regions 
(including both CDR and framework residues) of immunoglobulins of one species, 

10 typically mouse, fused to constant regions of another species, typically human. See, e.g., 
United States Patent No. 5,807,715; Morrison et al, Proc. Natl Acad. Sci L&4.81(21); 
6851-5 (1984); Sharon et ai, Nature 309(5966): 364-7 (1984); Takeda et al> Nature 
314(6010): 452-4 (1985), the disclosures of which are incorporated herein by reference in 
their entireties. Primatized and humanized antibodies typically include heavy and/or 

15 light chain CDRs from a murine antibody grafted into a non-human primate or human 
antibody V region framework, usually further comprising a human constant region, 
Riechmann et al, Nature 332(6162): 323-7 (1988); Co et ai, Nature 351(6326): 501-2 
(1991); United States Patent Nos. 6,054,297; 5,821,337; 5,770,196; 5,766,886; 
5,821,123; 5,869,619; 6,180,377; 6,013,256; 5,693,761; and 6,180,370, the disclosures of 

20 which are incorporated herein by reference in their entireties. 

Other useful antibody derivatives of the invention include heteromeric antibody 
complexes and antibody fusions, such as diabodies (bispecific antibodies), single-chain 
diabodies, and intrabodies. 

It is contemplated that the nucleic acids encoding the antibodies of the present 

25 invention can be operably joined to other nucleic acids forming a recombinant vector for 
cloning or for expression of the antibodies of the invention. The present invention 
includes any recombinant vector containing the coding sequences, or part thereof, 
whether for eukaryotic transduction, transfection or gene therapy. Such vectors may be 
prepared using conventional molecular biology techniques, known to those with skill in 

30 the art, and would comprise DNA encoding sequences for the immunoglobulin V-regions 
including framework and CDRs or parts thereof, and a suitable promoter either with or 
without a signal sequence for intracellular transport. Such vectors may be transduced or 
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transfected into eukaryotic cells or used for gene therapy (Marasco et al., Proc. Natl. 
Acad. Sci. (USA) 90: 7889-7893 (1993^: Duan et al.. Proc. Natl Acad, Scl (USA) 91: 
5075-5079 (1994), by conventional techniques, known to those with skill in the art. 
The antibodies of the present invention, including fragments and derivatives 
5 thereof, can usefully be labeled. It is, therefore, another aspect of the present invention to 
provide labeled antibodies that bind specifically to one or more of the proteins and 
protein fragments of the present invention, to one or more of the proteins and protein 
fragments encoded by the isolated nucleic acids of the present invention, or the binding 
of which can be competitively inhibited by one or more of the proteins and protein 

1 0 fragments of the present invention or one or more of the proteins and protein fragments 
encoded by the isolated nucleic acids of the present invention. 

The choice of label depends, in part, upon the desired use. 
For example, when the antibodies of the present invention are used for 
immunohistochemical staining of tissue samples, the label is preferably an enzyme that 

1 5 catalyzes production and local deposition of a detectable product. 

Enzymes typically conjugated to antibodies to permit their immunohistochemical 
visualization are well-known, and include alkaline phosphatase, P-galactosidase, glucose 
oxidase, horseradish peroxidase (HRP), and urease. Typical substrates for production 
and deposition of visually detectable products include o-nitrophenyl-beta-D- 

20 galactopyranoside (ONPG); o-phenylenediamine dihydrochloride (OPD); p-nitrophenyl 
phosphate (PNPP); p-nitrophenyl-beta-D-galactopryanoside (PNPG); 3\3'- 
diaminobenzidine (DAB); 3-amino-9-ethylcarbazole (AEC); 4-chloro-l-naphthol (CN); 
5-bromo-4-chloro-3-indolyl-phosphate (BCIP); ABTS®; BluoGal; iodonitrotetrazolium 
(INT); nitroblue tetrazolium chloride (NBT); phenazine methosulfate (PMS); 

25 phenolphthalein monophosphate (PMP); tetramethyl benzidine (TMB); tetranitroblue 
tetrazolium (TNBT); X-Gal; X-Gluc; and X-Glucoside. 

Other substrates can be used to produce products for local deposition that are 
luminescent. For example, in the presence of hydrogen peroxide (H2O2), horseradish 
peroxidase (HRP) can catalyze the oxidation of cyclic diacylhydrazides, such as luminol. 

30 Immediately following the oxidation, the luminol is in an excited state (intermediate 
reaction product), which decays to the ground state by emitting light. Strong 
enhancement of the light emission is produced by enhancers, such as phenolic 
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compounds. Advantages include high sensitivity, high resolution, and rapid detection 
without radioactivity and requiring only small amounts of antibody. See, e.g., Thorpe et 
al, Methods Enzymol 133: 331-53 (1986); Kricka et al,J. Immunoassay 17(1): 67-83 
(1996); and Lundqvist et al, I Biolumin. Chemilumin. 10(6): 353-9 (1995), the 
5 disclosures of which are incorporated herein by reference in their entireties. Kits for such 
enhanced chemiluminescent detection (ECL) are available commercially. 

The antibodies can also be labeled using colloidal gold. 

As another example, when the antibodies of the present invention are used, e.g., 
for flow cytometric detection, for scanning laser cytometric detection, or for fluorescent 
1 0 immunoassay, they can usefully be labeled with fluorophores. 

There are a wide variety of fluorophore labels that can usefully be attached to the 
antibodies of the present invention. 

For flow cytometric applications, both for extracellular detection and for 
intracellular detection, common useful fluorophores can be fluorescein isothiocyanate 
15 (FITC), allophycocyanin (APC), R-phycoerythrin (PE), peridinin chlorophyll protein 
(PerCP), Texas Red, Cy3, Cy5, fluorescence resonance energy tandem fluorophores such 
as PerCP-Cy5.5, PE-Cy5, PE-Cy5.5, PE-Cy7, PE-Texas Red, and APC-Cy7. 

Other fluorophores include, inter alia, Alexa Fluor® 350, Alexa Fluor® 488, 
Alexa Fluor® 532, Alexa Fluor® 546, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
20 Fluor® 647 (monoclonal antibody labeling kits available from Molecular Probes, Inc., 
Eugene, OR, USA), BODIPY dyes, such as BODIPY 493/503, BODIPYFL, BODIPY 
R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 558/568, 
BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY TR, BODIPY 
630/650, BODIPY 650/665, Cascade Blue, Cascade Yellow, Dansyl, lissamine 
25 rhodamine B, Marina Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, tetramethylrhodamine, Texas Red 
(available from Molecular Probes, Inc., Eugene, OR, USA), and Cy2, Cy3, Cy3.5, Cy5, 
Cy5.5, Cy7, all of which are also useful for fluorescently labeling the antibodies of the 
present invention. 

30 For secondary detection using labeled avidin, streptavidin, captavidin or 

neutravidin, the antibodies of the present invention can usefully be labeled with biotin. 
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When the antibodies of the present invention are used, e.g*., for Western blotting 
applications, they can usefully be labeled with radioisotopes, such as P, P, S, H, 
atid 125 L 

As another example, when the antibodies of the present invention are used for 
5 radioimmunotherapy, the label can usefully be 228 Th, 227 Ac, 225 Ac, 223 Ra, 2I3 Bi, 212 Pb, 
212 Bi, 2U At, 203 Pb, 194 0s, l88 Re, 186 Re, 153 Sm, 149 Tb, 13l I, 125 I, lll In, 105 Rh, ""Tc, 97 Ru, 
! ^, 90 Sr, w Y i 72 Se, 67 Cu,or 47 Sc. 

As another example, when the antibodies of the present invention are to be used 
for in vivo diagnostic use, they can be rendered detectable by conjugation to MRI 
10 contrast agents, such as gadolinium diethylenetriaminepentaacetic acid (DTP A), Lauffer 
et al, Radiology 207(2): 529-38 (1998), or by radioisotopic labeling. 

As would be understood, use of the labels described above is not restricted to the 
application for which they are mentioned. 

The antibodies of the present invention, including fragments and derivatives 
15 thereof, can also be conjugated to toxins, in order to target the toxin's ablative action to 
cells that display and/or express the proteins of the present invention. Commonly, the 
antibody in such immunotoxins is conjugated to Pseudomonas exotoxin A, diphtheria 
toxin, shiga toxin A, anthrax toxin lethal factor, or ricin. See Hall (ed.), Immunotoxin 
Methods and Protocols (Methods in Molecular Biology, vol. 166), Humana Press (2000); 
20 and Frankel et ah (eds.), Clinical Applications of Immunotoxins, Springer- Verlag (1998), 
the disclosures of which are incorporated herein by reference in their entireties. 

The antibodies of the present invention can usefully be attached to a substrate, 
and it is, therefore, another aspect of the invention to provide antibodies that bind 
specifically to one or more of the proteins and protein fragments of the present invention, 
25 to one or more of the proteins and protein fragments encoded by the isolated nucleic 
acids of the present invention, or the binding of which can be competitively inhibited by 
one or more of the proteins and protein fragments of the present invention or one or more 
of the proteins and protein fragments encoded by the isolated nucleic acids of the present 
invention, attached to a substrate. 
30 Substrates can be porous or nonporous, planar or nonplanar. 
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For example, the antibodies of the present invention can usefully be conjugated to 
filtration media, such as NHS-activated Sepharose or CNBr-activated Sepharose for 
purposes of immunoaffinity chromatography. 

For example, the antibodies of the present invention can usefully be attached to 
5 paramagnetic microspheres, typically by biotin-streptavidin interaction, which 

microspheres can then be used for isolation of cells that express or display the proteins of 
the present invention. As another example, the antibodies of the present invention can 
usefully be attached to the surface of a microtiter plate for ELIS A. 

As noted above, the antibodies of the present invention can be produced in 
10 prokaryotic and eukaryotic cells. It is, therefore, another aspect of the present invention 
to provide cells that express the antibodies of the present invention, including hybridoma 
cells, B cells, plasma cells, and host cells recombinantly modified to express the 
antibodies of the present invention. 

In yet a further aspect, the present invention provides aptamers evolved to bind 
15 specifically to one or more of the proteins and protein fragments of the present invention, 
to one or more of the proteins and protein fragments encoded by the isolated nucleic 
acids of the present invention, or the binding of which can be competitively inhibited by 
one or more of the proteins and protein fragments of the present invention or one or more 
of the proteins and protein fragments encoded by the isolated nucleic acids of the present 
20 invention. 

In sum, one of skill in the art, provided with the teachings of this invention, has 
available a variety of methods which may be used to alter the biological properties of the 
antibodies of this invention including methods which would increase or decrease the 
stability or half-life, immunogenicity, toxicity, affinity or yield of a given antibody 
25 molecule, or to alter it in any other way that may render it more suitable for a particular 
application. 

Transgenic Animals and Cells 

In another aspect, the invention provides transgenic cells and non-human 
30 organisms comprising nucleic acid molecules of the invention. In a preferred 

embodiment, the transgenic cells and non-human organisms comprise a nucleic acid 
molecule encoding an LSP. In a preferred embodiment, the LSP comprises an amino 
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acid sequence selected from SEQ ID NO: 165 through 284, or a fragment, mutein, 
homologous protein or allelic variant thereof. In another preferred embodiment, the 
transgenic cells and non-human organism comprise an LSNA of the invention, preferably 
an LSNA comprising a nucleotide sequence selected from the group consisting of SEQ 
5 ID NO: 1 through 164, or a part, substantially similar nucleic acid molecule, allelic 
variant or hybridizing nucleic acid molecule thereof. 

In another embodiment, the transgenic cells and non-human organisms have a 
targeted disruption or replacement of the endogenous orthologue of the human LSG. The 
transgenic cells can be embryonic stem cells or somatic cells. The transgenic non-human 

10 organisms can be chimeric, nonchimeric heterozygotes, and nonchimeric homozygotes. 
Methods of producing transgenic animals are well-known in the art. See, e.g., Hogan et 
al, Manipulating the Mouse Embryo: A Laboratory Manual 2d ed., Cold Spring Harbor 
Press (1999); Jackson et al, Mouse Genetics and Transgenics: A Practical Approach, 
Oxford University Press (2000); and Pinkert, Transgenic Animal Technology: A 

15 Laboratory Handbook , Academic Press (1999). 

Any technique known in the art may be used to introduce a nucleic acid molecule 
of the invention into an animal to produce the founder lines of transgenic animals. Such 
techniques include, but are not limited to, pronuclear microinjection, (see, e.g., Paterson 
et al, Appl. Microbiol. Biotechnol 40: 691-698 (1994); Carver et al, Biotechnology 11: 

20 1263-1270 (1993); Wright et al, Biotechnology 9: 830-834 (1991); and U.S. Patent 
4,873,191 (1989 retrovirus-mediated gene transfer into germ lines, blastocysts or 
embryos (see, eg, Van der Putten etal, Proa Natl Acad. Set, USA 82: 6148-6152 
(1985)); gene targeting in embryonic stem cells (see, e.g., Thompson et al, Cell 56: 
313-321 (1989)); electroporation of cells or embryos (see, e.g., Lo, 1983, Mol. Cell Biol. 

25 3: 1803-1814 (1983)); introduction using a gene gun (see, e.g, Ulmer et al, Science 259: 
1745-49 (1993); introducing nucleic acid constructs into embryonic pleuripotent stem 
cells and transferring the stem cells back into the blastocyst; and sperm-mediated gene 
transfer (see, e.g, Lavitrano et al, Cell 57: 717-723 (1989)). 

Other techniques include, for example, nuclear transfer into enucleated oocytes of 

30 nuclei from cultured embryonic, fetal, or adult cells induced to quiescence (see, e.g., 
Campell et al, Nature 380: 64-66 (1996); Wilmut et al, Nature 385: 810-813 (1997)). 
The present invention provides for transgenic animals that carry the transgene (i.e., a 
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nucleic acid molecule of the invention) in all their cells, as well as animals which cany 
the transgene in some, but not all their cells, i. e., mosaic animals or chimeric animals. 

The transgene may be integrated as a single transgene or as multiple copies, such 
as in concatamers, e. g., head-to-head tandems or head-to-tail tandems. The transgene 
5 may also be selectively introduced into and activated in a particular cell type by 

following, e.g., the teaching of Lasko et al et aL> Proc. Natl. Acad. Sci. USA 89: 6232- 
6236 (1992). The regulatory sequences required for such a cell-type specific activation 
will depend upon the particular cell type of interest, and will be apparent to those of skill 
in the art. 

10 Once transgenic animals have been generated, the expression of the recombinant 

gene may be assayed utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PCR techniques to analyze animal tissues to 
verify that integration of the transgene has taken place. The level of mRNA expression 
of the transgene in the tissues of the transgenic animals may also be assessed using 

1 5 techniques which include, but are not limited to, Northern blot analysis of tissue samples 
obtained from the animal, in situ hybridization analysis, and reverse transcriptase-PCR 
(RT-PCR). Samples of transgenic gene-expressing tissue may also be evaluated 
immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

20 Once the founder animals are produced, they may be bred, inbred, outbred, or 

crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in 
order to produce compound transgenics that express the transgene at higher levels 

25 because of the effects of additive expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a given integration site in order to 
both augment expression and eliminate the need for screening of animals by DNA 
analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 

30 appropriate for an experimental model of interest. 

Transgenic animals of the invention have uses which include, but are not limited 
to, animal model systems useful in elaborating the biological function of polypeptides of 
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the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

Methods for creating a transgenic animal with a disruption of a targeted gene are 
5 also well-known in the art. In general, a vector is designed to comprise some nucleotide 
sequences homologous to the endogenous targeted gene. The vector is introduced into a 
cell so that it may integrate, via homologous recombination with chromosomal 
sequences, into the endogenous gene, thereby disrupting the function of the endogenous 
gene. The transgene may also be selectively introduced into a particular cell type, thus 

10 inactivating the endogenous gene in only that cell type. See, e.g., Gu et aL, Science 265: 
103-106 (1994). The regulatory sequences required for such a cell-type specific 
inactivation will depend upon the particular cell type of interest, and will be apparent to 
those of skill in the art. See, e.g., Smithies et al. y Nature 317: 230-234 (1985); Thomas et 
al, Cell 51: 503-512 (1987); Thompson et al 9 Cell 5: 313-321 (1989). 

15 In one embodiment, a mutant, non-functional nucleic acid molecule of the 

invention (or a completely unrelated DNA sequence) flanked by DNA homologous to the 
endogenous nucleic acid sequence (either the coding regions or regulatory regions of the 
gene) can be used, with or without a selectable marker and/or a negative selectable 
marker, to transfect cells that express polypeptides of the invention in vivo. In another 

20 embodiment, techniques known in the art are used to generate knockouts in cells that 
contain, but do not express the gene of interest. Insertion of the DNA construct, via 
targeted homologous recombination, results in inactivation of the targeted gene. Such 
approaches are particularly suited in research and agricultural fields where modifications 
to embryonic stem cells can be used to generate animal offspring with an inactive 

25 targeted gene. See, e.g., Thomas, supra and Thompson, supra. However this approach 
can be routinely adapted for use in humans provided the recombinant DNA constructs are 
directly administered or targeted to the required site in vivo using appropriate viral 
vectors that will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 

30 express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e.g., knockouts) are administered to a 
patient in vivo. Such cells may be obtained from an animal or patient or an MHC 
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compatible donor and can include, but are not limited to fibroblasts, bone marrow cells, 
blood cells (e.g., lymphocytes), adipocytes, muscle cells, endothelial cells etc. The cells 
are genetically engineered in vitro using recombinant DNA techniques to introduce the 
coding sequence of polypeptides of the invention into the cells, or alternatively, to disrupt 
5 the coding sequence and/or endogenous regulatory sequence associated with the 

polypeptides of the invention, e.g., by transduction (using viral vectors, and preferably 
vectors that integrate the transgene into the cell genome) or transfection procedures, 
including, but not limited to, the use of plasmids, cosmids, YACs, naked DNA, 
electroporation, liposomes, etc. 

10 The coding sequence of the polypeptides of the invention can be placed under the 

control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e.g., in the circulation, or intraperitoneally. 

1 5 Alternatively, the cells can be incorporated into a matrix and implanted in the 

body, e.g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. See, e.g., U.S. Patents 5,399,349 and 5,460,959, each of which is 
incorporated by reference herein in its entirety. 

20 When the cells to be administered are non-autologous or non-MHC compatible 

cells, they can be administered using well-known techniques which prevent the 
development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange 
of components with the immediate extracellular environment, does not allow the 

25 introduced cells to be recognized by the host immune system. 

Transgenic and "knock-out" animals of the invention have uses which include, 
but are not limited to, animal model systems useful in elaborating the biological function 
of polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating such 

30 conditions and/or disorders. 
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Computer Readable Means 

A further aspect of the invention relates to a computer readable means for storing 
the nucleic acid and amino acid sequences of the instant invention. In a preferred 
embodiment, the invention provides a computer readable means for storing SEQ ID NO: 
5 1 through 164 and SEQ ID NO: 165 through 284 as described herein, as the complete set 
of sequences or in any combination. The records of the computer readable means can be 
accessed for reading and display and for interface with a computer system for the 
application of programs allowing for the location of data upon a query for data meeting 
certain criteria, the comparison of sequences, the alignment or ordering of sequences 

10 meeting a set of criteria, and the like. 

The nucleic acid and amino acid sequences of the invention are particularly useful 
as components in databases useful for search analyses as well as in sequence analysis 
algorithms. As used herein, the terms "nucleic acid sequences of the invention" and 
"amino acid sequences of the invention" mean any detectable chemical or physical 

1 5 characteristic of a polynucleotide or polypeptide of the invention that is or may be 
reduced to or stored in a computer readable form. These include, without limitation, 
chromatographic scan data or peak data, photographic data or scan data therefrom, and 
mass spectrographs data. 

This invention provides computer readable media having stored thereon 

20 sequences of the invention. A computer readable medium may comprise one or more of 
the following: a nucleic acid sequence comprising a sequence of a nucleic acid sequence 
of the invention; an amino acid sequence comprising an amino acid sequence of the 
invention; a set of nucleic acid sequences wherein at least one of said sequences 
comprises the sequence of a nucleic acid sequence of the invention; a set of amino acid 

25 sequences wherein at least one of said sequences comprises the sequence of an amino 
acid sequence of the invention; a data set representing a nucleic acid sequence 
comprising the sequence of one or more nucleic acid sequences of the invention; a data 
set representing a nucleic acid sequence encoding an amino acid sequence comprising the 
sequence of an amino acid sequence of the invention; a set of nucleic acid sequences 

30 wherein at least one of said sequences comprises the sequence of a nucleic acid sequence 
of the invention; a set of amino acid sequences wherein at least one of said sequences 
comprises the sequence of an amino acid sequence of the invention; a data set 
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representing a nucleic acid sequence comprising the sequence of a nucleic acid sequence 

of the invention; a data set representing a nucleic acid sequence encoding an amino acid 

sequence comprising the sequence of an amino acid sequence of the invention. The 

computer readable medium can be any composition of matter used to store information or 
5 data, including, for example, commercially available floppy disks, tapes, hard drives, 

compact disks, and video disks. 

Also provided by the invention are methods for the analysis of character 

sequences, particularly genetic sequences. Preferred methods of sequence analysis 

include, for example, methods of sequence homology analysis, such as identity and 
10 similarity analysis, RNA structure analysis, sequence assembly, cladistic analysis, 

sequence motif analysis, open reading frame determination, nucleic acid base calling, and 

sequencing chromatogram peak analysis. 

A computer-based method is provided for performing nucleic acid sequence 

identity or similarity identification. This method comprises the steps of providing a 
15 nucleic acid sequence comprising the sequence of a nucleic acid of the invention in a 

computer readable medium; and comparing said nucleic acid sequence to at least one 

nucleic acid or amino acid sequence to identify sequence identity or similarity. 

A computer-based method is also provided for performing amino acid homology 

identification, said method comprising the steps of: providing an amino acid sequence 
20 comprising the sequence of an amino acid of the invention in a computer readable 

medium; and comparing said an amino acid sequence to at least one nucleic acid or an 

amino acid sequence to identify homology. 

A computer-based method is still further provided for assembly of overlapping 

nucleic acid sequences into a single nucleic acid sequence, said method comprising the 
25 steps of: providing a first nucleic acid sequence comprising the sequence of a nucleic 

acid of the invention in a computer readable medium; and screening for at least one 

overlapping region between said first nucleic acid sequence and a second nucleic acid 

sequence. 

Diagnostic Methods for Lung Cancer 

30 

The present invention also relates to quantitative and qualitative diagnostic assays 
and methods for detecting, diagnosing, monitoring, staging and predicting cancers by 
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comparing expression of an LSNA or an LSP in a human patient that has or may have 
lung cancer, or who is at risk of developing lung cancer, with the expression of an LSNA 
or an LSP in a normal human control. For purposes of the present invention, "expression 
of an LSNA" or "LSNA expression" means the quantity of LSG mRNA that can be 
5 measured by any method known in the art or the level of transcription that can be 
measured by any method known in the art in a cell, tissue, organ or whole patient. 
Similarly, the term "expression of an LSP" or "LSP expression" means the amount of 
LSP that can be measured by any method known in the art or the level of translation of 
an LSG LSNA that can be measured by any method known in the art. 

1 0 The present invention provides methods for diagnosing lung cancer in a patient, 

in particular squamous cell carcinoma, by analyzing for changes in levels of LSNA or 
LSP in cells, tissues, organs or bodily fluids compared with levels of LSNA or LSP in 
cells, tissues, organs or bodily fluids of preferably the same type from a normal human 
control, wherein an increase, or decrease in certain cases, in levels of an LSNA or LSP in 

1 5 the patient versus the normal human control is associated with the presence of lung 
cancer or with a predilection to the disease. In another preferred embodiment, the 
present invention provides methods for diagnosing lung cancer in a patient by analyzing 
changes in the structure of the mRNA of an LSG compared to the mRNA from a normal 
control. These changes include, without limitation, aberrant splicing, alterations in 

20 polyadenylation and/or alterations in 5' nucleotide capping. In yet another preferred 
embodiment, the present invention provides methods for diagnosing lung cancer in a 
patient by analyzing changes in an LSP compared to an LSP from a normal control. 
These changes include, e.g., alterations in glycosylation and/or phosphorylation of the 
LSP or subcellular LSP localization. 

25 In a preferred embodiment, the expression of an LSNA is measured by 

determining the amount of an mRNA that encodes an amino acid sequence selected from 
SEQ ID NO: 165 through 284, a homolog, an allelic variant, or a fragment thereof. In a 
more preferred embodiment, the LSNA expression that is measured is the level of 
expression of an LSNA mRNA selected from SEQ ID NO: 1 through 164, or a 

30 hybridizing nucleic acid, homologous nucleic acid or allelic variant thereof, or a part of 
any of these nucleic acids. LSNA expression may be measured by any method known in 
the art, such as those described supra, including measuring mRNA expression by 
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Northern blot, quantitative or qualitative reverse transcriptase PCR (RT-PCR), 
microarray, dot or slot blots or in situ hybridization. See, e.g., Ausubel (1992), supra; 
Ausubel (1999), supra; Sambrook (1989), supra; and Sambrook (2001), supra. LSNA 
transcription may be measured by any method known in the art including using a reporter 
5 gene hooked up to the promoter of an LSG of interest or doing nuclear run-off assays. 
Alterations in mRNA structure, e.g., aberrant splicing variants, may be determined by 
any method known in the art, including, RT-PCR followed by sequencing or restriction 
analysis. As necessary, LSNA expression may be compared to a known control, such as 
normal lung nucleic acid, to detect a change in expression. 

10 In another preferred embodiment, the expression of an LSP is measured by 

determining the level of an LSP having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 165 through 284, a homolog, an allelic variant, or a fragment 
thereof. Such levels are preferably determined in at least one of cells, tissues, organs 
and/or bodily fluids, including determination of normal and abnormal levels. Thus, for 

15 instance, a diagnostic assay in accordance with the invention for diagnosing over- or 
underexpression of LSNA or LSP compared to normal control bodily fluids, cells, or 
tissue samples may be used to diagnose the presence of lung cancer. The expression 
level of an LSP may be determined by any method known in the art, such as those 
described supra. In a preferred embodiment, the LSP expression level may be 

20 determined by radioimmunoassays, competitive-binding assays, ELISA, Western blot, 
FACS, immunohistochemistry, immunoprecipitation, proteomic approaches: 
two-dimensional gel electrophoresis (2D electrophoresis) and non-gel-based approaches 
such as mass spectrometry or protein interaction profiling. See, e.g, Harlow (1999), 
supra; Ausubel (1992), supra; and Ausubel (1999), supra. Alterations in the LSP 

25 structure may be determined by any method known in the art, including, e.g. , using 
antibodies that specifically recognize phosphoserine, phosphothreonine or 
phosphotyrosine residues, two-dimensional polyacrylamide gel electrophoresis (2D 
PAGE) and/or chemical analysis of amino acid residues of the protein. Id. 

In a preferred embodiment, a radioimmunoassay (RIA) or an ELISA is used. An 

30 antibody specific to an LSP is prepared if one is not already available. In a preferred 
embodiment, the antibody is a monoclonal antibody. The anti-LSP antibody is bound to 
a solid support and any free protein binding sites on the solid support are blocked with a 
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protein such as bovine serum albumin. A sample of interest is incubated with the 
antibody on the solid support under conditions in which the LSP will bind to the anti-LSP 
antibody. The sample is removed, the solid support is washed to remove unbound 
material, and an anti-LSP antibody that is linked to a detectable reagent (a radioactive 
5 substance for RIA and an enzyme for ELISA) is added to the solid support and incubated 
under conditions in which binding of the LSP to the labeled antibody will occur. After 
binding, the unbound labeled antibody is removed by washing. For an ELISA, one or 
more substrates are added to produce a colored reaction product that is based upon the 
amount of an LSP in the sample. For an RIA, the solid support is counted for radioactive 

1 0 decay signals by any method known in the art. Quantitative results for both RIA and 
ELISA typically are obtained by reference to a standard curve. 

Other methods to measure LSP levels are known in the art. For instance, a 
competition assay may be employed wherein an anti-LSP antibody is attached to a solid 
support and an allocated amount of a labeled LSP and a sample of interest are incubated 

15 with the solid support. The amount of labeled LSP detected which is attached to the solid 
support can be correlated to the quantity of an LSP in the sample. 

Of the proteomic approaches, 2D PAGE is a well-known technique. Isolation of 
individual proteins from a sample such as serum is accomplished using sequential 
separation of proteins by isoelectric point and molecular weight. Typically, polypeptides 

20 are first separated by isoelectric point (the first dimension) and then separated by size 
using an electric current (the second dimension). In general, the second dimension is 
perpendicular to the first dimension. Because no two proteins with different sequences 
are identical on the basis of both size and charge, the result of 2D PAGE is a roughly 
square gel in which each protein occupies a unique spot. Analysis of the spots with 

25 chemical or antibody probes, or subsequent protein microsequencing can reveal the 
relative abundance of a given protein and the identity of the proteins in the sample. 

Expression levels of an LSNA can be determined by any method known in the 
art, including PCR and other nucleic acid methods, such as ligase chain reaction (LCR) 
and nucleic acid sequence based amplification (NASBA), can be used to detect malignant 

30 cells for diagnosis and monitoring of various malignancies. For example, 

reverse-transcriptase PCR (RT-PCR) is a powerful technique which can be used to detect 
the presence of a specific mRNA population in a complex mixture of thousands of other 
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mRNA species. In RT-PCR, an mRNA species is first reverse transcribed to 
complementary DNA (cDNA) with use of the enzyme reverse transcriptase; the cDNA is 
then amplified as in a standard PCR reaction. 

Hybridization to specific DNA molecules (e.g., oligonucleotides) arrayed on a 
5 solid support can be used to both detect the expression of and quantitate the level of 
expression of one or more LSNAs of interest. In this approach, all or a portion of one or 
more LSNAs is fixed to a substrate. A sample of interest, which may comprise RNA, 
e.g., total RNA or polyA-selected mRNA, or a complementary DNA (cDNA) copy of the 
RNA is incubated with the solid support under conditions in which hybridization will 

10 occur between the DNA on the solid support and the nucleic acid molecules in the 

sample of interest. Hybridization between the substrate-bound DNA and the nucleic acid 
molecules in the sample can be detected and quantitated by several means, including, 
without limitation, radioactive labeling or fluorescent labeling of the nucleic acid 
molecule or a secondary molecule designed to detect the hybrid. 

15 The above tests can be carried put on samples derived from a variety of cells, 

bodily fluids and/or tissue extracts such as homogenates or solubilized tissue obtained 
from a patient. Tissue extracts are obtained routinely from tissue biopsy and autopsy 
material. Bodily fluids useful in the present invention include blood, urine, saliva or any 
other bodily secretion or derivative thereof. By blood it is meant to include whole blood, 

20 plasma, serum or any derivative of blood. In a preferred embodiment, the specimen 
tested for expression of LSNA or LSP includes, without limitation, lung tissue, fluid 
obtained by bronchial alveolar lavage (BAL), sputum, lung cells grown in cell culture, 
blood, serum, lymph node tissue and lymphatic fluid. In another preferred embodiment, 
especially when metastasis of a primary lung cancer is known or suspected, specimens 

25 include, without limitation, tissues from brain, bone, bone marrow, liver, adrenal glands 
and colon. In general, the tissues may be sampled by biopsy, including, without 
limitation, needle biopsy, e.g., transthoracic needle aspiration, cervical mediatinoscopy, 
endoscopic lymph node biopsy, video-assisted thoracoscopy, exploratory thoracotomy, 
bone marrow biopsy and bone marrow aspiration. See Scott, supra and Franklin, pp. 

30 529-570, in Kane, supra. For early and inexpensive detection, assaying for changes in 
LSNAs or LSPs in cells in sputum samples may be particularly useful. Methods of 
obtaining and analyzing sputum samples is disclosed in Franklin, supra. 



WO 02/068633 PCT/USO 1/436 12 

-99- 

All the methods of the present invention may optionally include determining the 
expression levels of one or more other cancer markers in addition to determining the 
expression level of an LSNA or LSP. In many cases, the use of another cancer marker 
will decrease the likelihood of false positives or false negatives. In one embodiment, the 
5 one or more other cancer markers include other LSNA or LSPs as disclosed herein. Other 
cancer markers useful in the present invention will depend on the cancer being tested and 
are known to those of skill in the art. In a preferred embodiment, at least one other 
cancer marker in addition to a particular LSNA or LSP is measured. In a more preferred 
embodiment, at least two other additional cancer markers are used. In an even more 
10 preferred embodiment, at least three, more preferably at least five, even more preferably 
at least ten additional cancer markers are used. 

Diagnosing 

In one aspect, the invention provides a method for determining the expression 
levels and/or structural alterations of one or more LSNAs and/or LSPs in a sample from a 

15 patient suspected of having lung cancer. In general, the method comprises the steps of 
obtaining the sample from the patient, determining the expression level or structural 
alterations of an LSNA and/or LSP and then ascertaining whether the patient has lung 
cancer from the expression level of the LSNA or LSP. In general, if high expression 
relative to a control of an LSNA or LSP is indicative of lung cancer, a diagnostic assay is 

20 considered positive if the level of expression of the LSNA or LSP is at least two times 
higher, and more preferably are at least five times higher, even more preferably at least 
ten times higher, than in preferably the same cells, tissues or bodily fluid of a normal 
human control. In contrast, if low expression relative to a control of an LSNA or LSP is 
indicative of lung cancer, a diagnostic assay is considered positive if the level of 

25 expression of the LSNA or LSP is at least two times lower, more preferably are at least 
five times lower, even more preferably at least ten times lower than in preferably the 
same cells, tissues or bodily fluid of a normal human control. The normal human control 
may be from a different patient or from uninvolved tissue of the same patient. 

The present invention also provides a method of determining whether lung cancer 

30 has metastasized in a patient. One may identify whether the lung cancer has metastasized 
by measuring the expression levels and/or structural alterations of one or more LSNAs 
and/or LSPs in a variety of tissues. The presence of an LSNA or LSP in a certain tissue 
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at levels higher than that of corresponding noncancerous tissue (e.g., the same tissue 
from another individual) is indicative of metastasis if high level expression of an LSNA 
or LSP is associated with lung cancer. Similarly, the presence of an LSNA or LSP in a 
tissue at levels lower than that of corresponding noncancerous tissue is indicative of 
5 metastasis if low level expression of an LSNA or LSP is associated with lung cancer. 
Further, the presence of a structurally altered LSNA or LSP that is associated with lung 
cancer is also indicative of metastasis. 

In general, if high expression relative to a control of an LSNA or LSP is indicative of 
metastasis, an assay for metastasis is considered positive if the level of expression of the 

10 LSNA or LSP is at least two times higher, and more preferably are at least five times 
higher, even more preferably at least ten times higher, than in preferably the same cells, 
tissues or bodily fluid of a normal human control. In contrast, if low expression relative 
to a control of an LSNA or LSP is indicative of metastasis, an assay for metastasis is 
considered positive if the level of expression of the LSNA or LSP is at least two times 

15 lower, more preferably are at least five times lower, even more preferably at least ten 
times lower than in preferably the same cells, tissues or bodily fluid of a normal human 
control. 

The LSNA or LSP of this invention may be used as element in an array or a 
multi-analyte test to recognize expression patterns associated with lung cancers or other 
20 lung related disorders. In addition, the sequences of either the nucleic acids or proteins 
may be used as elements in a computer program for pattern recognition of lung disorders. 

Staging 

The invention also provides a method of staging lung cancer in a human patient. 

25 The method comprises identifying a human patient having lung cancer and analyzing 
cells, tissues or bodily fluids from such human patient for expression levels and/or 
structural alterations of one or more LSNAs or LSPs. First, one or more tumors from a 
variety of patients are staged according to procedures well-known in the art, and the 
expression level of one or more LSNAs or LSPs is determined for each stage to obtain a 

30 standard expression level for each LSNA and LSP. Then, the LSNA or LSP expression 
levels are determined in a biological sample from a patient whose stage of cancer is not 
known. The LSNA or LSP expression levels from the patient are then compared to the 
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standard expression level. By comparing the expression level of the LSNAs and LSPs 
from the patient to the standard expression levels, one may determine the stage of the 
tumor. The same procedure may be followed using structural alterations of an LSNA or 
LSP to determine the stage of a lung cancer. 

5 Monitoring 

Further provided is a method of monitoring lung cancer in a human patient. One 
may monitor a human patient to determine whether there has been metastasis and, if there 
has been, when metastasis began to occur. One may also monitor a human patient to 
determine whether a preneoplastic lesion has become cancerous. One may also monitor 

10 a human patient to determine whether a therapy, e.g. , chemotherapy, radiotherapy or 
surgery, has decreased or eliminated the lung cancer. The method comprises identifying 
a human patient that one wants to monitor for lung cancer, periodically analyzing cells, 
tissues or bodily fluids from such human patient for expression levels of one or more 
LSNAs or LSPs, and comparing the LSNA or LSP levels over time to those LSNA or 

15 LSP expression levels obtained previously. Patients may also be monitored by 

measuring one or more structural alterations in an LSNA or LSP that are associated with 
lung cancer. 

If increased expression of an LSNA or LSP is associated with metastasis, 
treatment failure, or conversion of a preneoplastic lesion to a cancerous lesion, then 

20 detecting an increase in the expression level of an LSNA or LSP indicates that the tumor 
is metastasizing, that treatment has failed or that the lesion is cancerous, respectively. 
One having ordinary skill in the art would recognize that if this were the case, then a 
decreased expression level would be indicative of no metastasis, effective therapy or 
failure to progress to a neoplastic lesion. If decreased expression of an LSNA or LSP is 

25 associated with metastasis, treatment failure, or conversion of a preneoplastic lesion to a 
cancerous lesion, then detecting an decrease in the expression level of an LSNA or LSP 
indicates that the tumor is metastasizing, that treatment has failed or that the lesion is 
cancerous, respectively. In a preferred embodiment, the levels of LSNAs or LSPs are 
determined from the same cell type, tissue or bodily fluid as prior patient samples. 

30 Monitoring a patient for onset of lung cancer metastasis is periodic and preferably is 
done on a quarterly basis, but may be done more or less frequently. 
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The methods described herein can further be utilized as prognostic assays to 
identify subjects having or at risk of developing a disease or disorder associated with 
increased or decreased expression levels of an LSNA and/or LSP. The present invention 
provides a method in which a test sample is obtained from a human patient and one or 
5 more LSNAs and/or LSPs are detected. The presence of higher (or lower) LSNA or LSP 
levels as compared to normal human controls is diagnostic for the human patient being at 
risk for developing cancer, particularly lung cancer. The effectiveness of therapeutic 
agents to decrease (or increase) expression or activity of one or more LSNAs and/or 
LSPs of the invention can also be monitored by analyzing levels of expression of the 
10 LSNAs and/or LSPs in a human patient in clinical trials or in in vifro screening assays 
such as in human cells. In this way, the gene expression pattern can serve as a marker, 
indicative of the physiological response of the human patient or cells, as the case may be, 
to the agent being tested. 

Detection of Genetic Lesions or Mutations 

15 The methods of the present invention can also be used to detect genetic lesions or 

mutations in an LSG, thereby determining if a human with the genetic lesion is 
susceptible to developing lung cancer or to determine what genetic lesions are 
responsible, or are partly responsible, for a person's existing lung cancer. Genetic 
lesions can be detected, for example, by ascertaining the existence of a deletion, insertion 

20 and/or substitution of one or more nucleotides from the LSGs of this invention, a 

chromosomal rearrangement of LSG, an aberrant modification of LSG (such as of the 
methylation pattern of the genomic DNA), or allelic loss of an LSG. Methods to detect 
such lesions in the LSG of this invention are known to those having ordinary skill in the 
art following the teachings of the specification. 

25 Methods of Detecting Noncancerous Lung Diseases 

The invention also provides a method for determining the expression levels 
and/or structural alterations of one or more LSNAs and/or LSPs in a sample from a 
patient suspected of having or known to have a noncancerous lung disease. In general, 
30 the method comprises the steps of obtaining a sample from the patient, determining the 
expression level or structural alterations of an LSNA and/or LSP, comparing the 
expression level or structural alteration of the LSNA or LSP to a normal lung control, 
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and then ascertaining whether the patient has a noncancerous lung disease. In general, if 
high expression relative to a control of an LSNA or LSP is indicative of a particular 
noncancerous lung disease, a diagnostic assay is considered positive if the level of 
expression of the LSNA or LSP is at least two times higher, and more preferably are at 
5 least five times higher, even more preferably at least ten times higher, than in preferably 
the same cells, tissues or bodily fluid of a normal human control. In contrast, if low 
expression relative to a control of an LSNA or LSP is indicative of a noncancerous lung 
disease, a diagnostic assay is considered positive if the level of expression of the LSNA 
or LSP is at least two times lower, more preferably are at least five times lower, even 

10 more preferably at least ten times lower than in preferably the same cells, tissues or 
bodily fluid of a normal human control. The normal human control may be from a 
different patient or from uninvolved tissue of the same patient. 

One having ordinary skill in the art may determine whether an LSNA and/or LSP 
is associated with a particular noncancerous lung disease by obtaining lung tissue from a 

15 patient having a noncancerous lung disease of interest and determining which LSNAs 
and/or LSPs are expressed in the tissue at either a higher or a lower level than in normal 
lung tissue. In another embodiment, one may determine whether an LSNA or LSP 
exhibits structural alterations in a particular noncancerous lung disease state by obtaining 
lung tissue from a patient having a noncancerous lung disease of interest and determining 

20 the structural alterations in one or more LSNAs and/or LSPs relative to normal lung 
tissue. 

Methods for Identifying Lung Tissue 

25 In another aspect, the invention provides methods for identifying lung tissue. 

These methods are particularly useful in, e.g., forensic science, lung cell differentiation 
and development, and in tissue engineering. 

In one embodiment, the invention provides a method for determining whether a 
sample is lung tissue or has lung tissue-like characteristics. The method comprises the 

30 steps of providing a sample suspected of comprising lung tissue or having lung tissue- 
like characteristics, determining whether the sample expresses one or more LSNAs 
and/or LSPs, and, if the sample expresses one or more LSNAs and/or LSPs, concluding 
that the sample comprises lung tissue. In a preferred embodiment, the LSNA encodes a 
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polypeptide having an amino acid sequence selected from SEQ ID NO: 165 through 284, 
or a homolog, allelic variant or fragment thereof. In a more preferred embodiment, the 
LSNA has a nucleotide sequence selected from SEQ JD NO: 1 through 164, or a 
hybridizing nucleic acid, an allelic variant or a part thereof. Determining whether a 
5 sample expresses an LSNA can be accomplished by any method known in the art. 

Preferred methods include hybridization to microarrays, Northern blot hybridization, and 
quantitative or qualitative RT-PCR. In another preferred embodiment, the method can 
be practiced by determining whether an LSP is expressed. Determining whether a 
sample expresses an LSP can be accomplished by any method known in the art. 

1 0 Preferred methods include Western blot, ELISA, RIA and 2D PAGE. In one 

embodiment, the LSP has an amino acid sequence selected from SEQ ID NO: 165 
through 284, or a homolog, allelic variant or fragment thereof. In another preferred 
embodiment, the expression of at least two LSNAs and/or LSPs is determined. In a more 
preferred embodiment, the expression of at least three, more preferably four and even 

1 5 more preferably five LSNAs and/or LSPs are determined. 

In one embodiment, the method can be used to determine whether an unknown 
tissue is lung tissue. This is particularly useful in forensic science, in which small, 
damaged pieces of tissues that are not identifiable by microscopic or other means are 
recovered from a crime or accident scene. In another embodiment, the method can be 

20 used to determine whether a tissue is differentiating or developing into lung tissue. This 
is important in monitoring the effects of the addition of various agents to cell or tissue 
culture, e.g., in producing new lung tissue by tissue engineering. These agents include, 
e-g; growth and differentiation factors, extracellular matrix proteins and culture medium. 
Other factors that may be measured for effects on tissue development and differentiation 

25 include gene transfer into the cells or tissues, alterations in pH, aqueous:air interface and 
various other culture conditions. 

Methods for Producing and Modifying Lung Tissue 

In another aspect, the invention provides methods for producing engineered lung 
30 tissue or cells. In one embodiment, the method comprises the steps of providing cells, 
introducing an LSNA or an LSG into the cells, and growing the cells under conditions in 
which they exhibit one or more properties of lung tissue cells. In a preferred 
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embodiment, the cells are pluripotent. As is well-known in the art, normal lung tissue 
comprises a large number of different cell types. Thus, in one embodiment, the 
engineered lung tissue or cells comprises one of these cell types. In another embodiment, 
the engineered lung tissue or cells comprises more than one lung cell type. Further, the 
5 culture conditions of the cells or tissue may require manipulation in order to achieve full 
differentiation and development of the lung cell tissue. Methods for manipulating culture 
conditions are well-known in the art. 

Nucleic acid molecules encoding one or more LSPs are introduced into cells, 
preferably pluripotent cells. In a preferred embodiment, the nucleic acid molecules 

10 encode LSPs having amino acid sequences selected from SEQ ID NO: 165 through 284, 
or homologous proteins, analogs, allelic variants or fragments thereof. In a more 
preferred embodiment, the nucleic acid molecules have a nucleotide sequence selected 
from SEQ ID NO: 1 through 164, or hybridizing nucleic acids, allelic variants or parts 
thereof In another highly preferred embodiment, an LSG is introduced into the cells. 

1 5 Expression vectors and methods of introducing nucleic acid molecules into cells are well- 
known in the art and are described in detail, supra. 

Artificial lung tissue may be used to treat patients who have lost some or all of 
their lung function. 

Pharmaceutical Compositions 

20 

In another aspect, the invention provides pharmaceutical compositions 
comprising the nucleic acid molecules, polypeptides, antibodies, antibody derivatives, 
antibody fragments, agonists, antagonists, and inhibitors of the present invention. In a 
preferred embodiment, the pharmaceutical composition comprises an LSNA or part 

25 thereof. In a more preferred embodiment, the LSNA has a nucleotide sequence selected 
from the group consisting of SEQ ID NO: 1 through 164, a nucleic acid that hybridizes 
thereto, an allelic variant thereof, or a nucleic acid that has substantial sequence identity 
thereto. In another preferred embodiment, the pharmaceutical composition comprises an 
LSP or fragment thereof. In a more preferred embodiment, the LSP having an amino 

30 acid sequence that is selected from the group consisting of SEQ ID NO: 165 through 284, 
a polypeptide that is homologous thereto, a fusion protein comprising all or a portion of 
the polypeptide, or an analog or derivative thereof. In another preferred embodiment, the 
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pharmaceutical composition comprises an anti-LSP antibody, preferably an antibody that 
specifically binds to an LSP having an amino acid that is selected from the group 
consisting of SEQ DD NO: 165 through 284, or an antibody that binds to a polypeptide 
that is homologous thereto, a fusion protein comprising all or a portion of the 
5 polypeptide, or an analog or derivative thereof. 

Such a composition typically contains from about 0. 1 to 90% by weight of a 
therapeutic agent of the invention formulated in and/or with a pharmaceutically 
acceptable carrier or excipient. 

Pharmaceutical formulation is a well-established art, and is further described in 

10 Gennaro (ed.), Remington: The Science and Practice of Pharmacy , 20 th ed., Lippincott, 
Williams & Wilkins (2000); Ansel et al, Pharmaceutical Dosage Forms and Drug 
Delivery Systems. 7 th ed., Lippincott Williams & Wilkins (1999); and Kibbe (ed.), 
Handbook of Pharmaceutical Excipients American Pharmaceutical Association, 3 rd ed. 
(2000), the disclosures of which are incorporated herein by reference in their entireties, 

1 5 and thus need not be described in detail herein. 

Briefly, formulation of the pharmaceutical compositions of the present invention 
will depend upon the route chosen for administration. The pharmaceutical compositions 
utilized in this invention can be administered by various routes including both enteral and 
parenteral routes, including oral, intravenous, intramuscular, subcutaneous, inhalation, 

20 topical, sublingual, rectal, intra-arterial, intramedullary, intrathecal, intraventricular, 
transmucosal, transdermal, intranasal, intraperitoneal, intrapulmonary, and intrauterine. 

Oral dosage forms can be formulated as tablets, pills, dragees, capsules, liquids, 
gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Solid formulations of the compositions for oral administration can contain 

25 suitable carriers or excipients, such as carbohydrate or protein fillers, such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; starch from corn, wheat, rice, potato, or 
other plants; cellulose, such as methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, or microcrystalline cellulose; gums including arabic and 
tragacanth; proteins such as gelatin and collagen; inorganics, such as kaolin, calcium 

30 carbonate, dicalcium phosphate, sodium chloride; and other agents such as acacia and 
alginic acid. 
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Agents that facilitate disintegration and/or solubilization can be added, such as 
the cross-linked polyvinyl pyrrolidone, agar, alginic acid, or a salt thereof, such as 
sodium alginate, microcrystalline cellulose, corn starch, sodium starch glycolate, and 
alginic acid. 

5 Tablet binders that can be used include acacia, methylcellulose, sodium 

carboxymethylcellulose, polyvinylpyrrolidone (Povidone™), hydroxypropyl 
methylcellulose, sucrose, starch and ethylcellulose. 

Lubricants that can be used include magnesium stearates, stearic acid, silicone 
fluid, talc, waxes, oils, and colloidal silica. 
10 Fillers, agents that facilitate disintegration and/or solubilization, tablet binders 

and lubricants, including the aforementioned, can be used singly or in combination. 

Solid oral dosage forms need not be uniform throughout. For example, dragee 
cores can be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which can also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
15 polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic 
solvents or solvent mixtures. 

Oral dosage forms of the present invention include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or 
sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or binders, 
20 such as lactose or starches, lubricants, such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds can be dissolved or 
suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with 
or without stabilizers. 

Additionally, dyestuffs or pigments can be added to the tablets or dragee coatings 
25 for product identification or to characterize the quantity of active compound, Le. 9 dosage. 
Liquid formulations of the pharmaceutical compositions for oral (enteral) 
administration are prepared in water or other aqueous vehicles and can contain various 
suspending agents such as methylcellulose, alginates, tragacanth, pectin, kelgin, 
carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl alcohol. The liquid 
30 formulations can also include solutions, emulsions, syrups and elixirs containing, 
together with the active compound(s), wetting agents, sweeteners, and coloring and 
flavoring agents. 



WO 02/068633 PCT7US01/43612 

-108- 

The pharmaceutical compositions of the present invention can also be formulated 
for parenteral administration. Formulations for parenteral administration can be in the 
form of aqueous or non-aqueous isotonic sterile injection solutions or suspensions. 

For intravenous injection, water soluble versions of the compounds of the present 
5 invention are formulated in, or if provided as a lyophilate, mixed with, a physiologically 
acceptable fluid vehicle, such as 5% dextrose ("D5"), physiologically buffered saline, 
0.9% saline, Hanks' solution, or Ringer's solution. Intravenous formulations may include 
carriers, excipients or stabilizers including, without limitation, calcium, human serum 
albumin, citrate, acetate, calcium chloride, carbonate, and other salts. 

10 Intramuscular preparations, e.g. a sterile formulation of a suitable soluble salt 

form of the compounds of the present invention, can be dissolved and administered in a 
pharmaceutical excipient such as Water-for-Injection, 0.9% saline, or 5% glucose 
solution. Alternatively, a suitable insoluble form of the compound can be prepared and 
administered as a suspension in an aqueous base or a pharmaceutical^ acceptable oil 

15 base, such as an ester of a long chain fatty acid (e.g., ethyl oleate), fatty oils such as 
sesame oil, triglycerides, or liposomes. 

Parenteral formulations of the compositions can contain various carriers such as 
vegetable oils, dimethylacetamide, dimethylformamide, ethyl lactate, ethyl carbonate, 
isopropyl myristate, ethanol, polyols (glycerol, propylene glycol, liquid polyethylene 

20 glycol, and the like). 

Aqueous injection suspensions can also contain substances that increase the 
viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. 
Non-lipid polycationic amino polymers can also be used for delivery. Optionally, the 
suspension can also contain suitable stabilizers or agents that increase the solubility of 

25 the compounds to allow for the preparation of highly concentrated solutions. 

Pharmaceutical compositions of the present invention can also be formulated to 
permit injectable, long-term, deposition. Injectable depot forms may be made by forming 
microencapsulated matrices of the compound in biodegradable polymers such as 
polylactide-polyglycolide. Depending upon the ratio of drug to polymer and the nature 

30 of the particular polymer employed, the rate of drug release can be controlled. Examples 
of other biodegradable polymers include poly(orthoesters) and poly(anhydrides). Depot 
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injectable formulations are also prepared by entrapping the drug in microemulsions that 
are compatible with body tissues. 

The pharmaceutical compositions of the present invention can be administered 
topically. 

5 For topical use the compounds of the present invention can also be prepared in 

suitable forms to be applied to the skin, or mucus membranes of the nose and throat, and 
can take the form of lotions, creams, ointments, liquid sprays or inhalants, drops, 
tinctures, lozenges, or throat paints. Such topical formulations further can include 
chemical compounds such as dimethylsulfoxide (DMSO) to facilitate surface penetration 

10 of the active ingredient. In other transdermal formulations, typically in patch-delivered 
formulations, the pharmaceutically active compound is formulated with one or more skin 
penetrants, such as 2-N-methyl-pyrrolidone (NMP) or Azone. A topical semi-solid 
ointment formulation typically contains a concentration of the active ingredient from 
about 1 to 20%, 5 to 10%, in a carrier such as a pharmaceutical cream base. 

15 For application to the eyes or ears, the compounds of the present invention can be 

presented in liquid or semi-liquid form formulated in hydrophobic or hydrophilic bases 
as ointments, creams, lotions, paints or powders. 

For rectal administration the compounds of the present invention can be 
administered in the form of suppositories admixed with conventional carriers such as 

20 cocoa butter, wax or other glyceride. 

Inhalation formulations can also readily be formulated. For inhalation, various 
powder and liquid formulations can be prepared. For aerosol preparations, a sterile 
formulation of the compound or salt form of the compound may be used in inhalers, such 
as metered dose inhalers, and nebulizers. Aerosolized forms may be especially useful for 

25 treating respiratory disorders. 

Alternatively, the compounds of the present invention can be in powder form for 
reconstitution in the appropriate pharmaceutically acceptable carrier at the time of 
delivery. 

The pharmaceutically active compound in the pharmaceutical compositions of the 
30 present invention can be provided as the salt of a variety of acids, including but not 
limited to hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic acid. Salts 
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tend to be more soluble in aqueous or other protonic solvents than are the corresponding 
free base forms. 

After pharmaceutical compositions have been prepared, they are packaged in an 
appropriate container and labeled for treatment of an indicated condition, 
5 The active compound will be present in an amount effective to achieve the 

intended purpose. The determination of an effective dose is well within the capability of 
those skilled in the art. 

A "therapeutically effective dose" refers to that amount of active ingredient, for 
example LSP polypeptide, fusion protein, or fragments thereof, antibodies specific for 
10 LSP, agonists, antagonists or inhibitors of LSP, which ameliorates the signs or symptoms 
of the disease or prevents progression thereof; as would be understood in the medical 
arts, cure, although desired, is not required. 

The therapeutically effective dose of the pharmaceutical agents of the present 
invention can be estimated initially by in vitro tests, such as cell culture assays, followed 
15 by assay in model animals, usually mice, rats, rabbits, dogs, or pigs. The animal model 
can also be used to determine an initial preferred concentration range and route of 
administration. 

For example, the ED50 (the dose therapeutically effective in 50% of the 
population) and LD50 (the dose lethal to 50% of the population) can be determined in 

20 one or more cell culture of animal model systems. The dose ratio of toxic to therapeutic 
effects is the therapeutic index, which can be expressed as LD50/ED50. Pharmaceutical 
compositions that exhibit large therapeutic indices are preferred. 

The data obtained from cell culture assays and animal studies are used in 
formulating an initial dosage range for human use, and preferably provide a range of 

25 circulating concentrations that includes the ED50 with little or no toxicity. After 

administration, or between successive administrations, the circulating concentration of 
active agent varies within this range depending upon pharmacokinetic factors well- 
known in the art, such as the dosage form employed, sensitivity of the patient, and the 
route of administration. 

30 The exact dosage will be determined by the practitioner, in light of factors 

specific to the subject requiring treatment. Factors that can be taken into account by the 
practitioner include the severity of the disease state, general health of the subject, age, 
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weight, gender of the subject, diet, time and frequency of administration, drug 
combination^), reaction sensitivities, and tolerance/response to therapy. Long-acting 
pharmaceutical compositions can be administered every 3 to 4 days, every week, or once 
every two weeks depending on half-life and clearance rate of the particular formulation. 
5 Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up to a total 

dose of about 1 g, depending upon the route of administration. Where the therapeutic 
agent is a protein or antibody of the present invention, the therapeutic protein or antibody 
agent typically is administered at a daily dosage of 0.01 mg to 30 mg/kg of body weight 
of the patient (e.g., 1 mg/kg to 5 mg/kg). The pharmaceutical formulation can be 

10 administered in multiple doses per day, if desired, to achieve the total desired daily dose. 
Guidance as to particular dosages and methods of delivery is provided in the 
literature and generally available to practitioners in the art. Those skilled in the art will 
employ different formulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

15 conditions, locations, etc. 

Conventional methods, known to those of ordinary skill in the art of medicine, 
can be used to administer the pharmaceutical formulation(s) of the present invention to 
the patient. The pharmaceutical compositions of the present invention can be 
administered alone, or in combination with other therapeutic agents or interventions. 

20 Therapeutic Methods 

The present invention further provides methods of treating subjects having 
defects in a gene of the invention , e.g., in expression, activity, distribution, localization, 
and/or solubility, which can manifest as a disorder of lung function. As used herein, 
25 'treating" includes all medically-acceptable types of therapeutic intervention, including 
palliation and prophylaxis (prevention) of disease. The teim "treating" encompasses any 
improvement of a disease, including minor improvements. These methods are discussed 
below. 

Gene TJterapy and Vaccines 
30 The isolated nucleic acids of the present invention can also be used to drive in 

vivo expression of the polypeptides of the present invention. In vivo expression can be 
driven from a vector, typically a viral vector, often a vector based upon a replication 
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incompetent retrovirus, an adenovirus, or an adeno-associated virus (AAV) , for purpose 
of gene therapy. In vivo expression can also be driven from signals endogenous to the 
nucleic acid or from a vector, often a plasmid vector, such as pVAXl (Invitrogen, 
Carlsbad, CA, USA), for purpose of "naked" nucleic acid vaccination, as further 
5 described in U.S. Patents 5,589,466; 5,679,647; 5,804,566; 5,830,877; 5,843,913; 

5,880,104; 5,958,891; 5,985,847; 6,017,897; 6,110,898; and 6,204,250, the disclosures of 
which are incorporated herein by reference in their entireties. For cancer therapy, it is 
preferred that the vector also be tumor-selective. See, e.g., Doronin et al.,J. Virol. 75: 
3314-24(2001). 

1 0 In another embodiment of the therapeutic methods of the present invention, a 

therapeutically effective amount of a pharmaceutical composition comprising a nucleic 
acid of the present invention is administered. The nucleic acid can be delivered in a 
vector that drives expression of an LSP, fusion protein, or fragment thereof, or without 
such vector. Nucleic acid compositions that can drive expression of an LSP are 

15 administered, for example, to complement a deficiency in the native LSP, or as DNA 
vaccines. Expression vectors derived from virus, replication deficient retroviruses, 
adenovirus, adeno-associated (AAV) virus, herpes virus, or vaccinia virus can be used as 
can plasmids. See, e.g., Cid-Arregui, supra. In a preferred embodiment, the nucleic acid 
molecule encodes an LSP having the amino acid sequence of SEQ ID NO: 165 through 

20 284, or a fragment, fusion protein, allelic variant or homolog thereof. 

In still other therapeutic methods of the present invention, pharmaceutical 
compositions comprising host cells that express an LSP, fusions, or fragments thereof 
can be administered. In such cases, the cells are typically autologous, so as to 
circumvent xenogeneic or allotypic rejection, and are administered to complement 

25 defects in LSP production or activity. In a preferred embodiment, the nucleic acid 
molecules in the cells encode an LSP having the amino acid sequence of SEQ ID NO: 
165 through 284, or a fragment, fusion protein, allelic variant or homolog thereof. 

Antisense Administration 

Antisense nucleic acid compositions, or vectors that drive expression of an LSG 
30 antisense nucleic acid, are administered to downregulate transcription and/or translation 
of an LSG in circumstances in which excessive production, or production of aberrant 
protein, is the pathophysiologic basis of disease. 
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Antisense compositions useful in therapy can have a sequence that is 
complementary to coding or to noncoding regions of an LSG. For example, 
oligonucleotides derived from the transcription initiation site, e.g., between positions -10 
and +10 from the start site, are preferred. 
5 Catalytic antisense compositions, such as ribozymes, that are capable of 

sequence-specific hybridization to LSG transcripts, are also useful in therapy. See, e.g., 
Phylactou, Adv. DrugDeliv. Rev. 44(2-3): 97-108 (2000); Phylactou et al, Hum. Mol 
Genet 7(10): 1649-53 (1998); Rossi, Ciba Found. Symp. 209: 195-204 (1997); and 
Sigurdsson et al, Trends Biotechnol 13(8): 286-9 (1995), the disclosures of which are 

10 incorporated herein by reference in their entireties. 

Other nucleic acids useful in the therapeutic methods of the present invention are 
those that are capable of triplex helix formation in or near the LSG genomic locus. Such 
triplexing oligonucleotides are able to inhibit transcription. See, e.g., Intody et al, 
Nucleic Acids Res. 28(2 1): 4283-90 (2000); McGuffie et al , Cancer Res. 60(14): 3790-9 

15 (2000), the disclosures of which are incorporated herein by reference. Pharmaceutical 
compositions comprising such triplex forming oligos (TFOs) are administered in 
circumstances in which excessive production, or production of aberrant protein, is a 
pathophysiologic basis of disease. 

In a preferred embodiment, the antisense molecule is derived from a nucleic acid 

20 molecule encoding an LSP, preferably an LSP comprising an amino acid sequence of 
SEQ ID NO: 165 through 284, or a fragment, allelic variant or homolog thereof. In a 
more preferred embodiment, the antisense molecule is derived from a nucleic acid 
molecule having a nucleotide sequence of SEQ ID NO: 1 through 164, or apart, allelic 
variant, substantially similar or hybridizing nucleic acid thereof. 

25 Polypeptide Administration 

In one embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising an LSP, a 
fusion protein, fragment, analog or derivative thereof is administered to a subject with a 
clinically-significant LSP defect. 

30 Protein compositions are administered, for example, to complement a deficiency 

in native LSP. La other embodiments, protein compositions are administered as a vaccine 
to elicit a humoral and/or Gellular immune response to LSP. The immune response can 
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be used to modulate activity of LSP or, depending on the immunogen, to immunize 
against aberrant or aberrantly expressed forms, such as mutant or inappropriately 
expressed isoforms. In yet other embodiments, protein fusions having a toxic moiety are 
administered to ablate cells that aberrantly accumulate LSP. 
5 In a preferred embodiment, the polypeptide is an LSP comprising an amino acid 

sequence of SEQ ID NO: 165 through 284, or a fusion protein, allelic variant, homolog, 
analog or derivative thereof. In a more preferred embodiment, the polypeptide is 
encoded by a nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1 
through 164, or a part, allelic variant, substantially similar or hybridizing nucleic acid 
10 thereof. 

Antibody, Agonist and Antagonist Administration 

In another embodiment of the therapeutic methods of the present invention, a 
therapeutically effective amount of a pharmaceutical composition comprising an 
antibody (including fragment or derivative thereof) of the present invention is 

15 administered. As is well-known, antibody compositions are administered, for example, 
to antagonize activity of LSP, or to target therapeutic agents to sites of LSP presence 
and/or accumulation. In a preferred embodiment, the antibody specifically binds to an 
LSP comprising an amino acid sequence of SEQ ID NO: 165 through 284, or a fusion 
protein, allelic variant, homolog, analog or derivative thereof. In a more preferred 

20 embodiment, the antibody specifically binds to an LSP encoded by a nucleic acid 

molecule having a nucleotide sequence of SEQ ID NO: 1 through 164, or a part, allelic 
variant, substantially similar or hybridizing nucleic acid thereof. 

The present invention also provides methods for identifying modulators which 
bind to an LSP or have a modulatory effect on the expression or activity of an LSP. 

25 Modulators which decrease the expression or activity of LSP (antagonists) are believed 
to be useful in treating lung cancer. Such screening assays are known to those of skill in 
the art and include, without limitation, cell-based assays and cell- free assays. Small 
molecules predicted via computer imaging to specifically bind to regions of an LSP can 
also be designed, synthesized and tested for use in the imaging and treatment of lung 

30 cancer. Further, libraries of molecules can be screened for potential anticancer agents by 
assessing the ability of the molecule to bind to the LSPs identified herein. Molecules 
identified in the library as being capable of binding to an LSP are key candidates for 
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further evaluation for use in the treatment of lung cancer. In a preferred embodiment, 
these molecules will downregulate expression and/or activity of an LSP in cells. 

In another embodiment of the therapeutic methods of the present invention, a 
pharmaceutical composition comprising a non-antibody antagonist of LSP is 
5 administered. Antagonists of LSP can be produced using methods generally known in 
the art. In particular, purified LSP can be used to screen libraries of pharmaceutical 
agents, often combinatorial libraries of small molecules, to identify those that specifically 
bind and antagonize at least one activity of an LSP. 

In other embodiments a pharmaceutical composition comprising an agonist of an 
10 LSP is administered. Agonists can be identified using methods analogous to those used 
to identify antagonists. 

In a preferred embodiment, the antagonist or agonist specifically binds to and 
antagonizes or agonizes, respectively, an LSP comprising an amino acid sequence of 
SEQ ID NO: 165 through 284, or a fusion protein, allelic variant, homolog, analog or 
1 5 derivative thereof. In a more preferred embodiment, the antagonist or agonist 

specifically binds to and antagonizes or agonizes, respectively, an LSP encoded by a 
nucleic acid molecule having a nucleotide sequence of SEQ ID NO: 1 through 164, or a 
part, allelic variant, substantially similar or hybridizing nucleic acid thereof. 
Targeting Lung Tissue 

20 The invention also provides a method in which a polypeptide of the invention, or 

an antibody thereto, is linked to a therapeutic agent such that it can be delivered to the 
lung or to specific cells in the lung. In a preferred embodiment, an anti-LSP antibody is 
linked to a therapeutic agent and is administered to a patient in need of such therapeutic 
agent. The therapeutic agent may be a toxin, if lung tissue needs to be selectively 

25 destroyed. This would be useful for targeting and killing lung cancer cells. In another 
embodiment, the therapeutic agent maybe a growth or differentiation factor, which 
would be useful for promoting lung cell function. 

In another embodiment, an anti-LSP antibody may be linked to an imaging agent 
that can be detected using, e.g., magnetic resonance imaging, CT or PET. This would be 

30 useful for determining and monitoring lung function, identifying lung cancer tumors, and 
identifying noncancerous lung diseases. 
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EXAMPLES 



Example 1: Gene Expression analysis 

LSGs were identified by mKNA subtraction analysis using standard methods. 
The sequences were extended using GeneBank sequences, Incyte's proprietary database. 
5 From the nucleotide sequences, predicted amino acid sequences were prepared. 

DEX0291_1, DEX0291_2 correspond to SEQ ID NO.l, 2 etc. DEX0134 was the parent 
sequence found in the mRNA subtractions. 

DEX0291J DEX0134J DEX0291J65 

DEX0291_2 flex DEX0134 J 

10 DEX0291_3 DEX0134_2 DEX0291J66 

DEX0291_4 flex DEX0134_2 

DEX0291_5 DEX0134J DEX0291J67 

DEX0291_6 flex DEX0134_3 DEX0291 J68 

DEX0291J7 DEX0134_4 DEX0291J69 

15 DEX0291 J flex DEX0134_4 DEX0291J70 

DEX0291_9 DEX0134_5 DEX0291J71 

DEX0291 JO flex DEX0134_5 

DEX0291J1 DEX0134„6 DEX0291_172 

DEX0291 J2 flex DEX0134JS 

20 DEX0291J3 DEX0134_7 DEX0291J73 

DEX0291J4 flex DEX0134__7 

DEX0291_15 DEX0134_8 DEX0291J74 

DEX0291_16 flex DEX0134 J DEX0291 J75 

DEX0291_17 DEX0134__9 

25 DEX0291J8 DEX0134J0 DEX0291J76 

DEX0291_19 DEX0134J1 DEX0291J77 

DEX0291_20 flex DEX0134 _11 

DEX0291 21 DEX0134 12 DEX0291 178 



DEX0291 22 flexDEX0134 12 



DEX0291 179 
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DEX0291_23 
DEX0291_24 
DEX0291_25 
DEX0291_26 
5 DEX0291_27 
DEX0291_28 
DEX0291_29 
DEX0291_30 
DEX0291_31 

10 DEX0291J2 
DEX0291_33 
DEX0291_34 
DEX0291_35 
DEX0291_36 

15 DEX0291_37 
DEX0291J8 
DEX0291J9 
DEX0291_40 
DEX0291_41 

20 DEX0291_42 
DEX0291_43 
DEX0291_44 
DEX0291_45 
DEX0291_46 

25 DEX0291_47 
DEX0291_48 
DEX0291_49 
DEX0291_50 
DEX0291 51 
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DEX0134J3 DEX0291J80 

DEX0134J4 DEX0291J81 

DEX0134J5 DEX0291_182 

flex DEX0134J5 DEX0291J83 

DEX0134J6 DEX0291J84 

flex DEX0134J6 DEX0291 J85 

DEX0134J7 DEX0291_186 

flex DEX0134_17 DEX0291 J87 

DEXO 1 34 J 8 DEX029 1_1 8 8 

DEX0134J9 DEX0291J89 

DEX0134_20 DEX0291J90 

flex DEX0134_20 DEX0291_191 

DEXO 1 34_2 1 DEX029 1_1 92 

flexDEX0134_21 

DEX0134_22 DEX0291_193 

flex DEX0134_22 DEX0291J94 

DEX0134_23 

DEXO 134_24 DEX029 1_1 95 
DEX0134_25 

DEXO 134_27 DEX029 1_196 

flexDEX0134_27 

DEX0134_28 

DEX0134_29 DEX0291_197 

flex DEX0134_29 DEX0291 J98 

DEX0134_30 DEX0291J99 

flex DEX0134_30 DEX0291_200 

DEX0134_31 DEX0291_201 

flexDEX0134Jl 

DEX0134 32 DEX0291_202 
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DEX0291_52 DEX0134J3 

DEX0291_53 DEX0134J4 DEX0291J03 

DEX0291_54 flexDEX0134_34 DEX0291_204 

DEX0291_55 DEX0134J5 DEX0291_205 

5 DEX0291_56 flex DEXO 134_35 DEX0291_206 

DEX0291_57 DEX0134J6 DEX0291_207 

DEX0291__58 flexDEX0134_36 DEX0291J208 

DEX0291_59 DEX0134J7 DEX0291_209 

DEX0291_60 flexDEX0134_37 DEX0291_210 

10 DEX0291_61 DEX0134J8 DEX0291_211 

DEX0291_62 flex DEX0134_38 DEX0291_212 

DEX0291_63 DEX0134J9 DEX0291_213 

DEX0291 J54 flex DEX0134_39 

DEX0291_65 DEX0134_40 DEX0291_214 

1 5 DEX0291J56 flex DEX0134_40 

DEX0291_67 DEX0134_41 DEX0291_215 

DEX0291_68 flex DEX0134_41 

DEX0291_69 DEX0134_42 DEX0291„216 

DEX029 1_70 flex DEX0134_42 

20 DEX0291_71 DEX0134_43 

DEX0291J72 DEX0134_44 DEX0291_217 

DEX0291J73 flexDEX0134_44 DEX0291_218 

DEX0291__74 DEX0134_46 

DEX0291J75 flex DEX0134_46 

25 DEX0291J76 DEX0134_47 DEX0291_219 

DEX0291J77 flex DEXO 134_47 DEX0291_220 

DEX0291J78 DEX0134_48 DEX0291_221 

DEX029 1 J79 flex DEXO 134_48 DEX029 1_222 

DEX0291 80 DEX0134 49 DEX0291 223 
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DEX0291J1 
DEX0291J2 
DEX0291J3 
DEX0291_84 
5 DEX0291_85 
DEX0291 J6 
DEX0291J7 
DEX0291J8 
DEX0291J9 

10 DEX0291_90 
DEX0291_91 
DEX0291_92 
DEX0291J3 
DEX0291J4 

15 DEX0291_95 
DEX0291_96 
DEX0291J7 
DEX0291_98 
DEX0291J9 

20 DEX0291J00 
DEX0291J01 
DEX0291J02 
DEX0291_103 
DEX0291J04 

25 DEX0291J05 
DEX0291J06 
DEX0291_107 
DEX0291_108 
DEX0291 109 
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flex DEX0134_49 DEX0291„224 

DEX0134J 1 DEX029 1_225 

flexDEX0134_51 

DEX0134J2 DEX0291_226 

flexDEX0134_52 

DEXO 1 34 J3 DEX029 1 J227 

flex DEX0134_53 DEX0291_228 

DEX0134_54 DEX029 1_229 

flex DEX0134_54 DEX0291_230 

DEXO 1 34J5 DEX029 1_23 1 

flexDEX0134_55 

DEX0134J6 DEX029 1__232 

flex DEX0134_56 DEX0291_233 

DEXO 134_57 DEX029 1_234 

DEX0134_58 DEX029 1_235 

flexDEX0134_58 

DEXO 1 34_60 DEX029 1_23 6 

flexDEX0134_60 

DEXO 1 34_6 1 DEX029 1_23 7 

flexDEX0134_61 

DEXO 1 34_62 DEX029 1_23 8 

flex DEX0134_62 DEX0291_239 

DEXO 1 34_63 DEX029 1_240 

DEXO 1 3 4_64 DEX029 1_24 1 

flexDEX0134_64 

DEXO 1 34_65 DEX029 1_242 

flex DEX0134_65 DEX0291_243 

DEXO 1 34_66 DEX029 1_244 

flexDEX0134_66 
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DEX0291 J 10 DEX0134__67 

DEX0291_111 fiexDEX0134J7 DEX0291_245 

DEX0291J12 DEX0134J8 DEX0291_246 

DEX0291_1 13 flex DEX0134_68 DEX0291_247 

5 DEX0291J14 DEX0134_69 DEX0291_248 

DEX0291 J 15 flex DEX0134_69 DEX0291_249 

DEX0291J16 DEX0134J70 DEX0291_250 

DEX0291J17 flexDEX0134J70 DEX0291_251 

DEX0291J18 DEX0134J71 DEX0291_252 

10 DEX0291_1 19 DEX0134J72 DEX0291_253 

DEX029 1 J 20 flex DEXO 1 34J72 

DEX0291J21 DEX0134J73 DEX0291J54 

DEX0291J22 flex DEX0134J73 DEX0291_255 

DEX0291J23 DEX0134_74 DEX0291_256 

15 DEX0291_124 flex DEXO 134J74 

DEX0291J25 DEX0134__75 DEX0291_257 

DEX0291_126 flex DEX0134_75 

DEX0291J27 DEX0134_76 DEX0291_258 

DEX029 1 J28 flex DEX0134J76 DEX0291_259 

20 DEX0291J29 DEX0134J77 DEX0291_260 

DEX0291_130 flex DEX0134_77 

DEX0291J31 DEX0134J78 DEX0291_261 

DEX0291J32 flex DEX0134_78 

DEX0291J33 DEX0134J79 DEX0291_262 

25 DEX0291_134 DEX0134_80 DEX0291„263 

DEX0291J35 flex DEX0134_80 DEX0291_264 

DEX0291_136 DEX0134_81 DEX0291_265 

DEX0291J37 DEX0134J2 DEX0291J266 

DEX0291 138 DEX0134 83 DEX0291_267 
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DEX0291J39 flex DEX0134_83 

DEX0291J40 DEX0134J4 DEX0291_268 

DEX0291_141 flex DEX0134_84 

DEX0291J42 DEX0134J5 DEX0291_269 

5 DEX0291J43 DEX0134J6 DEX0291_270 

DEX0291_144 flex DEX0134_86 

DEX0291J45 DEX0134_87 

DEX0291J46 DEX0134J8 DEX0291_271 

DEX0291J47 DEX0134_89 DEX0291_272 

1 0 DEX0291J48 flex DEX0134_89 

DEX0291_149 DEX0134_90 DEX0291_273 

DEX0291_150 flexDEX0134_90 DEX0291_274 

DEX0291J51 DEX0134_91 DEX0291_275 

DEX0291_152 flex DEX0134_91 

15 DEX0291J53 DEX0134_92 DEX0291J76 

DEX0291J54 flex DEX0134_92 

DEX0291J55 DEX0134_93 DEX0291_277 

DEX0291J56 flex DEX0134_93 

DEX0291J57 DEX0134_94 

20 DEX0291J58 DEX0134_95 DEX0291.278 

DEX0291J59 DEX0134_96 DEX0291_279 

DEX0291J60 DEX0134_97 DEX0291_280 

DEX0291J61 flexDEX0134_97 DEX0291_281 

DEX0291_162 DEX0134_98 DEX0291_282 

25 DEX0291_163 DEX0134_99 DEX0291_283 

DEX0291J64 flexDEX0134_99 DEX0291_284 

The chromosomal locations were as follows: 



30 DEX0291 6 chromosome 14 
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10 



15 



20 



25 



DEX0291J2 
DEX0291_16 
DEX0291J9 
DEX0291_21 
DEX0291_23 
DEX0291_24 
DEX0291_26 
DEX0291_28 
DEX0291_31 
DEX0291_33 
DEX0291_34 
DEX0291J6 
DEX0291_38 
DEX0291_39 
DEX0291_40 
DEX0291_41 
DEX0291_42 
DEX0291_43 
DEX0291_44 
DEX0291_46 
DEX0291_48 
DEX0291_52 
DEX0291__53 
DEX0291_56 
DEX0291_58 
DEX0291_59 
DEX0291_60 
DEX0291_61 
DEX0291 62 



chromosome 2 
chromosome 19 
chromosome 17 
chromosome 1 
chromosome 12 
chromosome 16 
chromosome 10 
chromosome 4 
chromosome 1 1 
chromosome 1 1 
chromosome 1 1 
chromosome 5 
chromosome 1 1 
chromosome 16 
chromosome 13 
chromosome 13 
chromosome 10 
chromosome 10 
chromosome 7 
chromosome 12 
chromosome 10 
chromosome 7 
chromosome 7 
chromosome X 
chromosome 15 
chromosome 7 
chromosome 7 
chromosome 5 
chromosome 5 
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DEX0291J8 
DEX0291_70 
DEX0291_73 
DEX0291_77 
5 DEX0291J78 
DEX0291_79 
DEX0291J5 
DEX0291J9 
DEX0291J>1 

10 DEX0291_93 
DEX0291_94 
DEX0291_98 
DEX029L99 
DEX0291_101 

15 DEX0291J02 
DEX0291_107 
DEX0291_111 
DEX0291J18 
DEX0291J23 

20 DEX0291J24 
DEX0291J27 
DEX0291_128 
DEX0291_131 
DEX0291J32 

25 DEX0291_135 
DEX0291.138 
DEX0291_139 
DEX0291J41 
DEX0291 142 
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chromosome 5 
chromosome 8 
chromosome 2 
chromosome 8 
chromosome 16 
chromosome 16 
chromosome 15 
chromosome 20 
chromosome 2 
chromosome 1 
chromosome 15 
chromosome 15 
chromosome 2 
chromosome 8 
chromosome 8 
chromosome 4 
chromosome 7 
chromosome 7 
chromosome 4 
chromosome 4 
chromosome 6 
chromosome 16 
chromosome 3 
chromosome 3 
chromosome 1 
chromosome 18 
chromosome 18 
chromosome 3 
chromosome 1 
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5 



10 



DEX0291_ 


146 


chromosome 5 


DEX0291, 


150 


chromosome 1 


DEX0291.. 


151 


chromosome 7 


DEX0291, 


152 


chromosome 7 


DEX0291_ 


_153 


chromosome 5 


DEX0291_ 


156 


chromosome 12 


DEX0291. 


160 


chromosome 8 


DEX0291_ 


161 


chromosome 8 


DEX0291_ 


162 


chromosome 17 


DEX0291^ 


163 


chromosome 6 


DEX0291 


164 


chromosome 6 



LSGs were also identified by a systematic analysis of gene expression data in the 
LIFESEQ® Gold database available from Incyte Genomics Inc (Palo Alto, CA) using 
the data mining software package CLASP™ (Candidate Lead Automatic Search 

15 Program). CLASP™ is a set of algorithms that interrogate Incyte's database to identify 
genes that are both specific to particular tissue types as well as differentially expressed in 
tissues from patients with cancer. LifeSeq® Gold contains information about which 
genes are expressed in various tissues in the body and about the dynamics of expression 
in both normal and diseased states. CLASP™ first sorts the LifeSeq® Gold database into 

20 defined tissue types, such as breast, ovary and prostate. CLASP™ categorizes each 
tissue sample by disease state. Disease states include "healthy," "cancer," "associated 
with cancer," "other disease" and "other." Categorizing the disease states improves our 
ability to identify tissue and cancer-specific molecular targets. CLASP™ then performs a 
simultaneous parallel search for genes that are expressed both (1) selectively in the 

25 defined tissue type compared to other tissue types and (2) differentially in the "cancer" 
disease state compared to the other disease states affecting the same, or different, tissues. 
This sorting is accomplished by using mathematical and statistical filters that specify the 
minimiim change in expression levels and the minimum frequency that the differential 
expression pattern must be observed across the tissue samples for the gene to be 
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considered statistically significant. The CLASP™ algorithm quantifies the relative 
abundance of a particular gene in each tissue type and in each disease state. 

To find the LSGs of this invention, the following specific CLASP™ profiles were 
utilized: tissue-specific expression (CLASP 1), detectable expression only in cancer 
5 tissue (CLASP 2), highest differential expression for a given cancer (CLASP 4); 
differential expression in cancer tissue (CLASP 5), and. cDNA libraries were divided 
into 60 unique tissue types (early versions of LifeSeq® had 48 tissue types). Genes or 
ESTs were grouped into "gene bins," where each bin is a cluster of sequences grouped 
together where they share a common contig. The expression level for each gene bin was 

10 calculated for each tissue type. Differential expression significance was calculated with 
rigorous statistical significant testing taking into account variations in sample size and 
relative gene abundance in different libraries and within each library (for the equations 
used to determine statistically significant expression see Audic and Claverie "The 
significance of digital gene expression profiles," Genome Res 7(10): 986-995 (1997), 

15 including Equation 1 on page 987 and Equation 2 on page 988, the contents of which are 
incorporated by reference). Differentially expressed tissue-specific genes were selected 
based on the percentage abundance level in the targeted tissue versus all the other tissues 
(tissue-specificity). The expression levels for each gene in libraries of normal tissues or 
non-tumor tissues from cancer patients were compared with the expression levels in 

20 tissue libraries associated with tumor or disease (cancer-specificity). The results were 
analyzed for statistical significance. 

The selection of the target genes meeting the rigorous CLASP™ profile criteria 
were as follows: 

(a) CLASP 1: tissue-specific expression: To qualify as a CLASP 1 candidate, a 
25 gene must exhibit statistically significant expression in the tissue of interest 

compared to all other tissues. Only if the gene exhibits such differential 
expression with a 90% of confidence level is it selected as a CLASP 1 
candidate. 

(b) CLASP 2: detectable expression only in cancer tissue: To qualify as a CLASP 
30 2 candidate, a gene must exhibit detectable expression in tumor tissues and 

undetectable expression in libraries from normal individuals and libraries 
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from normal tissue obtained from diseased patients. In addition, such a gene 
must also exhibit further specificity for the tumor tissues of interest. 

(c) CLASP 4: highest differential expression for a given cancer: To qualify as a 
CLASP 4 candidate, a gene must be differentially expressed in tumor libraries 
in the tissue of interest compared to normal libraries for all tissues. In 
addition, it must be one of the 50 genes with the highest differential 
expression. 

(d) CLASP 5: differential expression in cancer tissue: To qualify as a CLASP 5 
candidate, a gene must be differentially expressed in tumor libraries in the 
tissue of interest compared to normal libraries for all tissues. Only if the gene 
exhibits such differential expression with a 90% of confidence level is it 
selected as a CLASP 5 candidate. 

DEX0291_86 Lung 5 H 
DEX0291„87 Lung 5H 

The individual tissue expression levels from the Incyte LifeSeq database were as follows: 



DEX0291J 

DEX0291J0 

DEX0291_100 

DEX0291J01 

DEX0291J04 

DEX029l_105 

DEX0291J08 

DEX0291J1 

DEX0291J10 

DEX0291J12 

DEX0291J13 

DEX0291_114 

DEX0291_115 

DEX0291J16 

DEX0291J17 

DEX0291J18 

DEX0291_119 

DEX0291_12 

DEX0291J20 

DEX0291_123 

DEX0291J24 

DEX0291J27 

DEX0291J28 

DEX0291_13 

DEX0291_131 

DEX0291_132 

DEX0291J37 

DEX0291_138 

DEX0291 139 



SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQ ID NO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 
SEQIDNO: 



I UTR.0044 
10 OVR .0021 

100 LMN.0028 

101 INS .0067 

104 UTR.0044 

105 UTR.0044 
108 LMN.0028 

II FTS.0006 
110 LIV.0038 

112 BID .0225 

113 BLD.0225 

114 PNS.0047 

115 PNS.0047 

116 LNG.0006 

117 LNG.0006 

118 LNG.0006 

119 PAN .0047 

12 FTS .0006 

120 PAN .0047 

123 UNC.004 

124 UNC.004 

127 INS .001 

128 PNS.007 

13 OVR .0154 

131 UTR.0006 

132 UTR.0006 

137 LMN.0056 

138 INS .001 

139 INS .001 



PAN .0059 


BMR .0064 


TON .0299 


PRO .0034 


UNC .004 


THR.0045 


NOS .0073 






GLB .0139 


UTR.0182 


NOS .022 


PAN .0059 


BMR .0064 


TON .0299 


PAN .0059 


BMR .0064 


TON .0299 


UNC .004 






SPL .0021 


LNG .0034 


INS .0038 


THY .004 


CRD .0068 





CRD .0068 
CRD .0068 
OVR .001 
OVR .001 
LMN .0028 
NOS .0073 
SPL .0021 
NOS .0073 
NOS .0073 
NOS .0073 
BID .0016 

LNG .0296 
LMN .0028 
LMN .0028 
PNS .0094 
MAM .0028 
MAM .0028 



BON .0169 
BON .0169 
PRO .0017 
PRO .0017 

GLB .0139 
LNG .0034 
GLB .0139 
LIV .0076 
LIV .0076 
LNG .0017 



FAL .0063 
FAL .0063 
MAM .0194 
UNC .004 
UNC .004 



BLD .0048 
BLD .0048 



INS .0038 

ADR .0089 
ADR .0089 
MAM .0019 



FAL .0251 
FAL .0126 
FAL .0126 
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DEX0291_14 
DEX0291J42 
DEX0291J46 
DEX0291J47 
5 DEX0291_148 
DEX0291J5 
DEX0291J51 
DEX0291_152 
DEX0291J55 

10 DEX0291J58 
DEX0291_160 
DEX0291J61 
DEX0291J8 
DEX0291J9 

15 DEX0291_2 
DEX0291_20 
DEX0291_21 
DEX0291_22 
DEX0291_23 

20 DEX0291_25 
DEX0291_26 
DEX0291_27 
DEX0291_29 
DEX0291_33 

25 DEX0291_34 
DEX0291J7 
DEX0291J8 
DEX0291_45 
DEX0291_47 

30 DEX0291_48 
DEX029i_49 
DEX0291_5 
DEX0291_51 
DEX0291_53 

35 DEX0291J4 
DEX0291_55 
DEX0291_56 
DEX0291_59 
DEX0291_65 

40 DEX0291_66 
DEX0291_67 
DEX0291_68 
DEX0291J7 
DEX0291J70 

45 DEX0291_71 
DEX0291J72 
DEX0291_74 
DEX0291_75 
DEX0291J78 

50 DEX0291J79 
DEX0291_8 
DEX0291_80 
DEX0291_86 
DEX0291_87 

55 DEX0291_88 
DEX0291J9 
DEX0291_9 
DEX0291 90 



SEQ ID NO: 14 
SEQ ID NO: 142 
SEQ ID NO: 146 
SEQ ID NO: 147 
SEQ ID NO: 148 
SEQ ID NO: 15 
SEQ ID NO: 151 
SEQ ID NO: 152 
SEQ ID NO: 155 
SEQ ID NO: 158 
SEQ ID NO: 160 
SEQ ID NO: 161 
SEQ ID NO: 18 
SEQ ID NO: 19 
SEQ ID NO: 2 
SEQ ID NO: 20 
SEQ ID NO: 21 
SEQ ID NO: 22 
SEQ ID NO: 23 
SEQ ID NO: 25 
SEQ ID NO: 26 
SEQ ID NO: 27 
SEQ ID NO: 29 
SEQ ID NO: 33 
SEQ ID NO: 34 
SEQ ID NO: 37 
SEQ ID NO: 38 
SEQ ID NO: 45 
SEQ ID NO: 47 
SEQ ID NO: 48 
SEQ ID NO: 49 
SEQ ID NO: 5 
SEQ ID NO: 51 
SEQ ED NO: 53 
SEQ ID NO: 54 
SEQ ID NO: 55 
SEQ ID NO: 56 
SEQ ID NO: 59 
SEQ ID NO: 65 
SEQ ID NO: 66 
SEQ ID NO: 67 
SEQ ID NO: 68 
SEQ ID NO: 7 
SEQ ID NO: 70 
SEQ ID NO: 71 
SEQ ID NO: 72 
SEQ ID NO: 74 
SEQ ID NO: 75 
SEQ ID NO: 78 
SEQ ID NO: 79 
SEQ ID NO: 8 
SEQ ID NO: 80 
SEQ ID NO: 86 
SEQ ID NO: 87 
SEQ ID NO: 88 
SEQ ID NO: 89 
SEQ ID NO: 9 
SEQ ID NO: 90 
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DEX0291_91 SEQIDNO:91 LMN.0028 NOS .0073 
DEX0291_92 SEQ ID NO: 92 INL .0045 
DEX029 1_93 SEQ ID NO: 93 INL .0045 
DEX0291_95 SEQ ID NO: 95 CRD .0023 BLD .0064 
5 DEX0291_97 SEQ ED NO: 97 ADR .0104 KID .0141 PLE .015 
DEX0291_98 SEQ ID NO: 98 ADR .0104 KID .0141 PLE .015 
DEX0291_99 SEQ ED NO: 99 LMN.0028 NOS .0073 
Abbreviation for tissues: 

BLO Blood; BRN Brain; CON Connective Tissue; CRD Heart; FTS Fetus; INL Intestine, Larj 
1 0 INS Intestine, Small; KID Kidney; LIV Liver; LNG Lung; MAM Breast; MSL Muscles; NRV 
Nervous Tissue; OVR Ovary; PRO Prostate; STO Stomach; THR Thyroid Gland; TNS Tonsil 
Adenoids; UTR Uterus 

Example 2: Relative Quantitation of Gene Expression 

1 5 Real-Time quantitative PCR with fluorescent Taqman probes is a quantitation 

detection system utilizing the 5'- 3' nuclease activity of Taq DNA polymerase. The 
method uses an internal fluorescent oligonucleotide probe (Taqman) labeled with a 5' 
reporter dye and a downstream, 3' quencher dye. During PCR, the 5 '-3 s nuclease activity 
of Taq DNA polymerase releases the reporter, whose fluorescence can then be detected 

20 by the laser detector of the Model 7700 Sequence Detection System (PE Applied 

Biosystems, Foster City, CA, USA). Amplification of an endogenous control is used to 
standardize the amount of sample RNA added to the reaction and normalize for Reverse 
Transcriptase (RT) efficiency. Either cyclophilin, glyceraldehyde-3 -phosphate 
dehydrogenase (GAPDH), ATPase, or 18S ribosomal RNA (rRNA) is used as this 

25 endogenous control. To calculate relative quantitation between all the samples studied, 
the target RNA levels for one sample were used as the basis for comparative results 
(calibrator). Quantitation relative to the "calibrator" can be obtained using the standard 
curve method or the comparative method (User Bulletin #2: ABI PRISM 7700 Sequence 
Detection System). 

30 The tissue distribution and the level of the target gene are evaluated for every 

sample in normal and cancer tissues. Total RNA is extracted from normal tissues, cancer 
tissues, and from cancers and the corresponding matched adjacent tissues. Subsequently, 
first strand cDNA is prepared with reverse transcriptase and the polymerase chain 
reaction is done using primers and Taqman probes specific to each target gene. The 

35 results are analyzed using the ABI PRISM 7700 Sequence Detector. The absolute 
numbers are relative levels of expression of the target gene in a particular tissue 
compared to the calibrator tissue. 
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One of ordinary skill can design appropriate primers. The relative levels of 
expression of the LSNA versus normal tissues and other cancer tissues can then be 
determined. All the values are compared to normal tissue (calibrator). These RNA 
samples are commercially available pools, originated by pooling samples of a particular 
5 tissue from different individuals. 

The relative levels of expression of the LSNA in pairs of matching samples and 1 
cancer and 1 normal/normal adjacent of tissue may also be determined. All the values 
are compared to normal tissue (calibrator). A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the normal adjacent sample for 
10 that same tissue from the same individual. 

In the analysis of matching samples, the LSNAs show a high degree of tissue 
specificity for the tissue of interest. These results confirm the tissue specificity results 
obtained with normal pooled samples. 

Further, the level of mRNA expression in cancer samples and the isogenic normal 
15 adjacent tissue from the same individual are compared. This comparison provides an 
indication of specificity for the cancer stage (e.g. higher levels of mRNA expression in 
the cancer sample compared to the normal adjacent). 

Altogether, the high level of tissue specificity, plus the mRNA overexpression in 
matching samples tested are indicative of SEQ ID NO: 1 through 164 being diagnostic 
20 markers for cancer. 



Sequences 


Sequence ID 


ddx QPCR code 


DEX0134_ 


JLO 


DEX0291JL8 


Lng2 6 1 


DEX0134~ 


JL7 


DEX0291_29 


Lng262 






DEX0291_30 




DEX0134_ 


_2 


DEX0291_3 


Lng259 






DEX0291~4 




DEX0134_ 


24 


DEX0291~40 


Lng260 


DEX0134_ 


~77 


DEX0291_129 


Lng264 






DEX0291_130 




DEX0134_ 


_80 


DEX0291_134 


Lng256 






DEX0291_135 




DEX0134_ 


_96 


DEX0291JL59 


Lng228 



DEX0134JL0; DEX0291_18 (SEQ ID NO:18); Lng261 

35 

Experiments are underway to test primers and probes for QPCR. 
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DEX0134_17; DEX0291_29(SEQ ID NO: 29); DEX0291_30(SEQ ID NO: 30); 
Lng262 

Experiments are underway to test primers and probes for QPCR. 
5 Primers Used for QPCR Expression Analysis in DEX291_29 



Primer Probe 
Oligo 


Start 
From 


End To 


queryLength 


sb j ctDescript 


lng262For 


33 


50 


18 


DEX0134 17 


lng262Rev 


122 


101 


22 


DEX0134 17 


lng262Probe 


54 


74 


21 


DEX0134 17 



DEX0134_2; DEX0291_3(SEQ ID NO: 3); DEX0291_4(SEQ ID NO:4) ; Lng259 
Experiments are underway to test primers and probes for QPCR. 

10 

DEX0134_24; DEX0291_40(SEQ ID NO: 40) ; Lng260 

Experiments are underway to test primers and probes for QPCR. 

15 DEX0134_77; DEX0291_129(SEQ ID NO: 129); DEX0291_130(SEQ ID NO: 130); 
Lng264 

Experiments are underway to test primers and probes for QPCR. 

20 DEX0134_80; DEX0291_134(SEQ ID NO: 134); DEX0291_135(SEQ ID NO: 135); 
Lng256 

Experiments are underway to test primers and probes for QPCR. 
25 DEX0134_96; DEX0291_159(SEQ ID NO: 159); Lng228 
Experiments are underway to test primers and probes for QPCR. 



Primers Used for QPCR Expression Analysis 



Primer Probe 
Oligo 


Start 
From 


End To 


queryLength 


sbj ctDescript 


lng228For 


814 


837 


24 


DEX0134 96 


lng228Rev 


94 0 


917 


24 


DEX0134 96 


lng228Probe 


906 


877 


30 


DEX0134 96 



30 

Example 3: Protein Expression 

The LSNA is amplified by polymerase chain reaction (PCR) and the amplified 
DNA fragment encoding the LSNA is subcloned in pET-21d for expression in E. coli. In 
addition to the LSNA coding sequence, codons for two amino acids, Met- Ala, flanking 
35 the NH 2 -terminus of the coding sequence of LSNA, and six histidines, flanking the 
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COOH-terminus of the coding sequence of LSNA, are incorporated to serve as initiating 
Met/restriction site and purification tag, respectively. 

An over-expressed protein band of the appropriate molecular weight may be 
observed on a Coomassie blue stained polyacrylamide gel. This protein band is 
5 confirmed by Western blot analysis using monoclonal antibody against 6X Histidine tag. 

Large-scale purification of LSP was achieved using cell paste generated from 
6-liter bacterial cultures, and purified using immobilized metal affinity chromatography 
(EMAC). Soluble fractions that had been separated from total cell lysate were incubated 
with a nickle chelating resin. The column was packed and washed with five column 
10 volumes of wash buffer. LSP was eluted stepwise with various concentration imidazole 
buffers. 

Example 4: Protein Fusions 

Briefly, the human Fc portion of the IgG molecule can be PCR amplified, using 
primers that span the 5'and 3' ends of the sequence described below. These primers also 

15 should have convenient restriction enzyme sites that will facilitate cloning into an 
expression vector, preferably a mammalian expression vector. For example, if pC4 
(Accession No. 209646) is used, the human Fc portion can be ligated into the BamHI 
cloning site. Note that the 3' BamHI site should be destroyed. Next, the vector containing 
the human Fc portion is re-restricted with BamHI, linearizing the vector, and a 

20 polynucleotide of the present invention, isolated by the PCR protocol described in 

Example 2, is ligated into this BamHI site. Note that the polynucleotide is cloned without 
a stop codon, otherwise a fusion protein will not be produced. If the naturally occurring 
signal sequence is used to produce the secreted protein, pC4 does not need a second 
signal peptide. Alternatively, if the naturally occurring signal sequence is not used, the 

25 vector can be modified to include a heterologous signal sequence. See, e. g., WO 
96/34891. 

Example 5: Production of an Antibody from a Polypeptide 

In general, such procedures involve immunizing an animal (preferably a mouse) 
with polypeptide or, more preferably, with a secreted polypeptide-expressing cell. Such 
30 cells may be cultured in any suitable tissue culture medium; however, it is preferable to 
culture cells in Earle's modified Eagle's medium supplemented with 10% fetal bovine 
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serum (inactivated at about 56°C) 5 and supplemented with about 10 g/1 of nonessential 
amino acids, about 1,000 U/ml of penicillin, and about 100, |ig/ml of streptomycin. The 
splenocytes of such mice are extracted and fused with a suitable myeloma cell line. Any 
suitable myeloma cell line may be employed in accordance with the present invention; 
5 however, it is preferable to employ the parent myeloma cell line (SP20), available from 
the ATCC. After fusion, the resulting hybridoma cells are selectively maintained in HAT 
medium, and then cloned by limiting dilution as described by Wands et al> 
Gastroenterology 80: 225-232 (1981). 

The hybridoma cells obtained through such a selection are then assayed to 

10 identify clones which secrete antibodies capable of binding the polypeptide. 
Alternatively, additional antibodies capable of binding to the polypeptide can be 
produced in a two-step procedure using anti-idiotypic antibodies. Such a method makes 
use of the fact that antibodies are themselves antigens, and therefore, it is possible to 
obtain an antibody which binds to a second antibody. In accordance with this method, 

15 protein specific antibodies are used to immunize an animal, preferably a mouse. The 
splenocytes of such an animal are then used to produce hybridoma cells, and the 
hybridoma cells are screened to identify clones which produce an antibody whose ability 
to bind to the protein-specific antibody can be blocked by the polypeptide. Such 
antibodies comprise anti-idiotypic antibodies to the protein specific antibody and can be 

20 used to immunize an animal to induce formation of further protein-specific antibodies. 
Using the Jameson- Wolf methods the following epitopes were predicted. (Jameson and 
Wolf, CABIOS, 4(1), 181-186, 1988, the contents of which are incorporated by 
reference). 

DEX029 1_1 66 Antigenicity Index( Jameson-Wolf) 
25 positions AI avg length 

21-37 1.23 17 

DEX029 1_1 68 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

69-80 1.07 12 

3 0 DEX029 1_1 69 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

15-25 1.06 11 

DEX029 1__170 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

35 54-64 1.06 11 

DEX029 1_175 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

166-176 1.36 11 

41-65 1.18 25 
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190-207 1.15 18 

71-84 1.03 14 

DEX029 1_1 77 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

5 42-54 1.12 13 

DEX029 1_1 82 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

35-50 1.20 16 

DEX0291_183 Antigenicity Index(Jameson-Wolf) 
10 positions Alavg length 

48-66 1.16 19 

68-87 1.13 20 

DEX029 1_1 84 Antigenicity Index(Jameson- Wolf) 
positions AI avg length 

15 52-93 1.05 42 

DEX029 1_1 85 Antigenicity Index(Jameson-Woif) 
positions AI avg length 

199-209 1.24 11 

4- 26 1.22 23 
20 322-353 1.09 32 

408-462 1.08 55 

467-482 1.01 16 

30-49 1.01 20 

DEX0291_187 Antigenicity Index(Jameson-Wolf) 
25 positions AI avg length 

67-80 1.15 14 

DEX029 1_1 93 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

5- 34 1.13 30 

3 0 DEX029 1_1 94 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

58-71 1.33 14 

195-210 1.06 16 

37-52 1.04 16 

35 DEX0291_196 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

70-87 1.15 18 

DEX029 1_1 99 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

40 79-91 1.10 13 

DEX0291_200 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

262-289 1.09 28 

225-234 1.07 10 

45 412-426 1.03 15 

DEX029 1_203 Antigenicity Index( Jameson-Wolf) 
positions AI avg length 

66-77 1.26 12 

DEX029 1_204 Antigenicity Index( Jameson-Wolf) 
50 positions AI avg length 

109-141 1.04 33 

61-78 1.03 18 

46-58 1.00 13 

DEX029 1_205 Antigenicity Index(Jameson-Wolf) 
55 positions AI avg length 

12-37 1.03 26 

DEX029 1_206 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 
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91-100 1.19 10 

DEX029 1_208 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

33-45 1.18 13 

5 105-122 1.10 18 

58-103 1.02 46 

DEX0291_212 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

373-393 1.24 21 

10 70-81 1.20 12 

430-457 1.11 28 

485-533 1.07 49 

204-254 1.06 51 

289-314 1.04 26 

15 141-165 1.03 25 

462-478 1.03 17 

126-135 1.02 10 

172-202 1.02 31 

318-363 1.02 46 

20 10-34 1.01 25 

98-119 1.00 22 

DEX0291_216 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

10-21 1.35 12 

25 DEX029 1 J21 8 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

662-694 1.20 33 

36-61 1.12 26 

98-118 1.10 21 

30 283-334 1.02 52 

699-740 1.01 42 

DEX029 1_22 1 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

20-32 1.17 13 

35 DEX0291_225 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

61-72 1.16 12 

3-58 1.07 56 

DEX0291_226 Antigenicity Index(Jameson-Wolf) 

40 positions AI avg length 

7-20 1.02 14 

DEX029 1_228 Antigenicity Index(Jaraeson-Wolf) 
positions AI avg length 

73-83 1.05 11 

45 170-183 1.01 14 

DEX0291_238 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

13-41 1.11 29 

DEX0291_239 Antigenicity Index(Jaraeson-Wolf) 

50 positions AI avg length 

38-54 1.25 17 

DEX0291_241 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

35-56 1.07 22 

55 DEX0291_243 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

35-48 1.10 14 

DEX029 1_245 Antigenicity Index(Jameson-Wolf) 
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positions AI avg length 

144-155 1.04 12 

DEX029 1_247 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

5 44-57 1.14 14 

93-107 1.06 15 

69-84 1.02 16 

DEX029 1J249 Antigenicity Index( Jameson-Wolf) 

positions AI avg length 

10 262-289 1.09 28 

225-234 1.07 10 

412-426 1.03 15 

DEX0291_25 1 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

15 91-100 1.19 10 

DEX029 1__255 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

14-25 1.18 12 

DEX029 1_256 Antigenicity Index(Jameson-Wolf) 
20 positions Alavg length 

12-21 1.11 10 

DEX029 1_257 Antigenicity Index(Jameson-Wolf) 

positions AI avg length 

21- 31 1.19 11 

25 DEX029 1_259 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

595-607 1.25 13 

446-457 1.15 12 

80-92 1.09 13 

30 632-641 1.08 10 

246-257 1.06 12 

1054-1073 1.06 20 
336-383 1.05 48 

955-975 1.02 21 

35 1477-1505 1.02 29 

425-439 1.01 15 

DEX029 1_264 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

22- 32 1.02 11 

40 DEX029 1_276 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

53-73 1.06 21 

DEX0291_280 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

45 32-48 1.04 17 

DEX029 1_28 1 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

34-48 1.23 15 

DEX029 1_282 Antigenicity Index(Jameson-Wolf) 
50 positions Alavg length 

58-113 1.10 56 

DEX029 1_284 Antigenicity Index(Jameson-Wolf) 
positions AI avg length 

111-131 1.05 21 

55 
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The predicted helicities were as follows: 



Topology=o24-43i55-77o 
Topology=i29-48o63-82i94-116o 

Topology=il44-166o209-231i312-334o349-37li373- 

Topology=o2 0 - 42 i 
Topology=i57-79o 
Topol ogy=o3 3 - 5 2 i 
Topology=i21-38o 
Topology=o3 6 - 5 8 i 
Topology=o2 0 - 3 7 i 
Topology=o616-638i 
Topology=o20-42i55-86o 
Topology=i5-22o27-45i 
Topol ogy= i 5 8 - 8 0 o 
Topology=o2 0-42i2 05 -227o 
Topology=i7-29o 

Topology=o761-780i828-850o865-883i896-918o983- 

Topol ogy =o5 5 - 7 7 i 
Topology=o5 0 - 6 7 i 
Topology=i2 0 - 3 9o 
Topology=ol0 -3 2 i 

Topology=o42-64i99-121ol26-14 8i 

Topology=i82-103o 

Topology=il3-35o55-77i 



DEX0291_169 PredHel=2 

DEX0291_170 PredHel=3 

DEX0291_175 PredHel«5 
5 395o 

DEX0291_183 PredHel=l 

DEX0291_190 PredHel=l 

DEX0291_193 PredHel=l 

DEX0291_195 PredHel=l 
10 DEX0291_209 PredHel=l 

DEX0291_213 PredHel=l 

DEX0291_218 PredHel=l 

DEX0291_223 PredHel=2 

DEX0291_229 PredHel=2 
15 DEX0291_239 PredHel=l 

DEX0291_247 PredHel=2 

DEX0291_254 PredHel=l 

DEX0291_259 PredHel=6 

1005il035-1052o 
20 DEX0291_260 PredHel=l 

DEX0291_262 PredHel=l 

DEX0291_270 PredHel=l 

DEX0291_272 PredHel=l 

DEX0291_279 PredHel=3 
25 DEX029l_28l PredHel=l 

DEX0291_283 PredHel=2 

Examples of post-translational modifications (PTMs) of the LSP of this invention 
are listed below. In addition, antibodies that specifically bind such post-translational 
modifications may be useful as a diagnostic or as therapeutic. Using the ProSite database 
30 (Bairoch et aL, Nucleic Acids Res. 25(1):217-221 (1997), the contents of which are 
incorporated by reference), the following PTMs were predicted for the LSPs of the 
invention (http://npsa-pbil.ibc^.:fto 
recently accessed October 23, 2001). 
DEX029 1_1 65 PkcPhospho_Site 8-10; 
35 DEX0291_1 66 Asn_Glycosylation 29-32; Ck2_Phospho_Site 9-12; PkcJPhospho_Site 25-27;3 1-33; 
DEX029 1_1 67 Myristyl 4-9;30-35;32-37; Prokar_Lipoprotein 24-34; 
DEX0291J69 Ck2_Phospho_Site 20-23; Pkc_Phospho_Site 20-22;90-92; 
DEX0291_170 Ck2_Phospho_Site 59-62; Pkc_Phospho_Site 2-4;9-ll;17-19;21-23;59-61; 
DEX0291 J72 Ck2J>hospho_Site 29-32;34-37; Myristyl 10-15; 
40 DEX0291J73 PkcJPhosphoJIite 33-35; 

DEX0291J75 Asn_Glycosylation 19M94;396-399; Ck2_Phospho_Site 164-1 67;308-3 1 1;344- 

347;405-408;414-417; Leucine__Zipper 291-3 12;298-3 19; Myristyl 97-102;174-179;176- 
181;258-263;375-380;431-436; PkcJPhospho_Site 3 04-306; 308-3 10; Tyr_Phospho_Site 
62-70; 

45 DEX0291J76 Camp_Phospho_Site 13-16; Ck2_Phospho_Site 5-8; Pkc_Phospho_Site 9-1 1;16-18; 
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DEX029 1 J77 Myristyl 20-25;52-57; Pkc_Phospho_Site 60-62;86-88; 
DEX0291J78 Ck2J>hospho_Site 25-28;33-36;46-49; PkcJPhosphoJSite 15-17;50-52; 
DEX0291J79 Ck2JPhosphoSite 58-61;80-83;84-87; Pkc_Phospho_Site 28-30; 
DEX0291J8 1 Ck2_Phospho_Site 1 1-14;46-49; PkcJPhosphoJSite 2-4;46-48; 
5 DEX0291 J82 Ck2JPhospho_Site 16-19;30-33; Myristyl 20-25; Pkc_Phospho_Site 47-49; 

DEX0291_183 AsnJ31ycosylation 80-83; Ck2JPhospho_Site 82-85; Myristyl 26-31;30-35;63-68; 

PkcJPhosphoJSite 10-12;82-84; 
DEX029 1_1 84 Myristyl 1 6-2 1 ;40-45;44-49; Pkc_Phospho_Site 59-6 1 ;77-79; 
DEX0291 J85 Amidation 15-18;88-91; Asn_Glycosylation 186-189;396-399; CampJPhospho_Site 
10 476-479; Ck2_Phospho_Site 31-34;74-77;l 10-1 13; 198-20 1;423-426; 

Fibrin_Ag^C_Domain 430-442; Myristyl 27-32;40-45;43-48; 169-174; 194-1 99;362- 
367;447-452;458-463;470-475; Pkc_Phospho_Site 336-338;392-394;41 1-413;474- 
476;479-481; TyrJ>hospho_Site 202-208;288-295; 
Pkc_Phospho_Site 29-31; 

Amidation 105-108; Bzip_Basic 125-139;126-139; Ck2_Phospho - Site 51-54;115-118; 
Glycosaminoglycan 23-26; Myristyl 2-7;37-42;96-101;102-107; Pkc_Phospho_Site 7- 
9 ; 144-146; 

Pkc_Phospho_Site 10-12; 

Ck2_Phospho_Site 19-22; Pkc_Phospho_Site 3-5;41-43; TyrJ>hospho_Site 5-13; 
Myristyl 49-54; PkcJPhospho_Site 24-26; ProkarJLipoprotein 38-48; 
Ck2JPhospho_Site 35-38;95-98; Myristyl 6-11; 
Myristyl 22-27; Pkc_Phospho_Site 43-45; 

Camp_Phospho_Site 15-18; Myristyl 24-29;27-32; Pkc_Phospho_Site 47-49;55-57; 
Asn_Glycosylation 30-33; Camp - Phospho_Site 62-65; 120-1 23; Ck2J ) hospho_Site 65- 
25 68;79-82;131-134;136-139;138-141;152-155; Myristyl 69-74;73-78; 100- 105; 198-203; 

PkcJPhospho_Site79-81;133-135;193-195; 
Asn_Glycosylation 32-35; Ck2_PhosphojSite 56-59; 
Asnjjlycosylation 36-39; Myristyl 12-17; Pkc_Phospho_Site 20-22; 
Ck2_Phospho._Site 73-76; Myristyl 12-17;17-22;66-71; Pkc_Phospho_Site 91-93; 
Asn_Glycosylation 125-128; CampJ>hospho_Site 70-73; Ck2_PhosphojSite 118-121; 
Myristyl 61-66;131-136; Pkc_Phospho_Site 32-34;68-70;80-82;89-91;140-142;153-155; 
Amidation 2-5; Asnjjlycosylation 74-77;127-130; Ck2_Phospho_Site 76-79;230- 
233;242-245;262-265;423-426;566-569; Leucine_Zipper 95-116; Myristyl 34-39;419- 
424;499-504;536-541;PkcJPhospho_Site 276-278;380-382;387-389;442-444;59 1-593; 
35 Tyr_Phospho_Site 561-568;562-568; 

DEX029 1__202 Myristyl 5-10; 

DEX0291_203 Oc2_Phospho_Site 31-34;48-51; Myristyl 4-9; Pkc_Phospho_Site 84-86; 
Prokar_Lipoprotein 39-49; 



DEX0291J86 
15 DEX0291 187 



DEX0291J88 
DEX0291J89 
DEX0291J90 
DEX0291_191 
DEX0291J92 
DEX0291J93 
DEX0291 194 



20 



DEX029 1J96 
DEX0291J97 
DEX0291J98 
30 DEX0291J99 

DEX0291 200 
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DEX0291_204 Asn_Glycosylation 64-67; Ck2_PhosphoJSite 33-36; 1 66- 1 69; 1 88-19 1 ;234-237;23 6-239; 

Pkc_Phospho_Site 66-68; 140- 142;243-245 ;249-25 1 ; Tyr_Phospho_Site 124-132; 
DEX0291_205 Camp_Phospho_Site 22-25; Myristyl 70-75; Pkc_Phospho_Site 13-15; 
DEX0291_206 Ck2J>hosphoJSite 66-69;96-99; Glycosarainoglycan 50-53; Myristyl 47-52;49-54;53- 
5 58;62-67; PkcJPhosphoJSite 12-14;132-134;141-143;192-194;210-212; 

ProkarJLipoprotein 159-169; 
DEX0291_207 Asn_Glycosylation 70-73; Ck2JPhospho_Site 15-18;81-84; 

DEX0291_208 Asnjjlycosylation 75-78; Camp_Phospho_Site 84-87; Ck2_Phospho_Site 36-39;78-81; 
PkcJPhospho_Site 111-113; 
10 DEX0291_209 Asnjjlycosylation 30-33; Myristyl 5-10; Pkc_PhosphoJ3ite 26-28; 

DEX0291_210 Asn_Glycosylation 56-59;90-93;173-176; Ck2J>hospho_Site 24-27;58-61;92-95;138- 
141 ; 148-1 5 1 ;270-273; Myristyl 45-50;49-54;306-31 1;3 12-31 7;354-359;397-402;426- 
431;433-438; Peroxidase^ 252-263; PkcJPhospho_Site 70-72;138-140;145-147;163- 
165;195-197;207-209;252-254;263-265;275-277;325-327;359-361;402-404;444-446; 
15 Wd_Repeats 267-281;351-365; 

DEX0291_21 1 Myristyl 5-10; 

DEX0291_212 Amidation 102-105;521-524; Asn_Glycosylation 33-36;62-65;201-204;230-233;313- 
316;342-345;343-346;454-457;536-539; Ck2_Phospho_Site 194-1 97;204-207;388- 
391;51 1-514; Myristyl 183-188;220-225; Pkc_Phospho_Site 6-8;143-145;177-179;286- 
20 288;345-347;398-400;404-406;521-523; Zinc_Finger_C2h2 25-45;53-73;81-101;109- 

129;137-157;165-185;193-213;221-241;249-269;277-297;305-325;333-353;361- 
381;389-409;417-437;445-465;473-493; 

DEX0291_213 Myristyl 24-29; 

DEX029 1_214 PkcJ>hospho_Site 9-11; 
25 DEX0291_215 Myristyl 5- 10;8-13; 11-16; 

DEX0291_216 Camp_Phospho_Site 18-21; Pkc_Phospho_Site 17-19;60-62; 

DEX0291_217 Pkc_Phospho_Site 30-32; 

DEX0291_218 AsnGlycosylation 72-75;261-264;370-373;474-477;516-519; Camp_Phospho_Site 224- 

227;366-369; Ck2_Phospho_Site 36-39;180-183;253-256;333-336;380-383;457- 
30 460;778-781; Myristyl 177-182;217-222;266-271;319-324;368-373;381-386;384- 

389;393-398;482-487;575-580;585-590;649-654;731-736;732-737; Pkc_Phospbo_Site 

50-52;151-153;315-317;475-477;507-509;513-515;637-639;653-655;694-696; 

TyrJ>hospho_Sitel93-200;290-296;681-688; 
DEX0291_219 Ck2_Phospho_Site 39-42; Myristyl 47-52;48-53; PkcJ>hospho_Site 39-41; 
35 DEX0291_220 Asn_Glycosylation 20-23; Ck2_Phospho_Site 123-126; Glycosaminoglycan 72-75; 

Myristyl 30-35;75-80; Pkc_Phospho_Site 123-125; Prokar_Lipoprotein 70-80; 

Tyr_Phospho_Site 107-114; 
DEX0291_221 Amidation 29-32; Asnjjlycosylation 23-26; Ck2J>hospho_Site 36-39; 

PkcJPhospho_Site 24-26;36-38; 
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DEX0291_222 Ck2_Phospho_Site 78-81; Myristyl 34-39;36-41; Pkc_Phospho_Site 96-98; 
DEX0291_223 Myristyl 27-32 ;3 1-36 ;33-3 8; 60-65 ;64-69;66-71;67-72;70-75;77-82;84-89;93-98;95- 

100;98-103; ProkarJLipoprotein 27-37;58-68; 
DEX0291_224 Asn_Glycosylation 30-33;181-184; CampJ>hospho_Site 37-40; Ck2_Phospho_Site 7- 
5 1 0; Pkc_PhosphoJSite 83-85; 

DEX0291_225 Asn_Glycosylation 24-27; Ck2 _Phospho_Site 3-6;72-75; Myristyl 20-25; 

PkcJPhospho_Site 48-50; 
DEX0291_226 PkcJPhosphoJSite 19-21 ; 
DEX0291_227 Leucine_Zipper 10-31; Pkc_Phospho_Site 3-5; 
10 DEX0291_228 Asn_Glycosylation 182-185; Camp_Phospho_Site 27-30; Ck2_Phospho_Site 21-24;41- 

44;78-8 1 ;98-l 0 1 ; 1 1 2-1 15; Myristyl 53-58;96-101;103-108; PkcJPhosphoJSite 9-ll;58- 

60; TyrPhospho_Site 76-82; 
DEX0291_229 Myristyl 13-18; 
DEX0291_231 PkcPhospho_Site 16-18; 
1 5 DEX0291_233 Ck2_Phospho_Site 79-82; Myristyl 22-27;47-52; Pkc_Phospho_Site 15-17;97-99; 

Prokar_Lipoprotein 42-52; 
DEX0291_235 Asn_Glycosylation 44-47; Myristyl 30-35; 
DEX0291_236 Ck2_Phospho_Site 40-43; PkcJ>hospho_Site 40-42; 

DEX0291_237 Ck2J>hospho_Site 13-16; Myristyl 19-24;23-28;54-59; Pkc_Phospho_Site 32-34;95-97; 
20 DEX0291_238 Amidation 14-17; PkcJPhospho_Site 14-16; 

DEX0291_239 Ck2_Phospho_Site 30-33; Myristyl 52-57;53-58;71-76; PkcJPhosphoJSite 49-51; 
DEX0291_240 Amidation 1 19-122; Camp M Phospho_Site 121-124; Ck2_Phospho_Site 124-127;181- 
184; Glycosaminoglycan 115-118; Myristyl 5-10;13-18;14-19;16-21;18-23;19-24;20- 
25;21-26;22-27;23-28;33-38;36-41;37-42;55-60;90-95;103-108;105-110;106-lll;108- 
25 113;116-121;136-141;170-175; Pkc_Phospho_Site 9-11; 

DEX0291_241 Asn_Glycosylation 19-22;53-56; Ck2J>hospho_Site 4-7;80-83; Myristyl 52-57; 

Pkc_Phospho_Site 103-105; 
DEX029 1_242 Leucine^Zipper 10-31; 
DEX0291_243 Myristyl 10-15;57-62; 
30 DEX0291_244 Rgd 15-17; 

DEX0291_245 Ck2JPhospho_Site 144-147; Myristyl 5-10;165-170; 

DEX0291_247 Asnjjlycosylation 73-76;101-104;167-170; CampJPhosphq_Site 230-233; 

Ck2JPhospho_Site 159-162;194-197; Glycosaminoglycan 33-36; Leucine_Zipper 198- 
219; Myristyl 2-7;34-39;74-79;87-92; 11 2-1 17; 116-121 ;1 19-1 24; 149-154; 164- 169;1 86- 
35 191;217-222; Pkc_Phospho_Site 43-45;77-79;129-131;134-136;171-173; 

DEX0291_248 Ck2J>hospho_Site 34-37; Myristyl 43-48; Pkc_Phospho_Site 58-60; Tyr_Phospho__Site 
16-24; 

DEX029 1_249 Amidation 2-5; Asnjjlycosylation 74-77; 127-130; Ck2_Phospho_Site 76-79;230- 

233;242-245;262-265;423-426;566-569; Leucine_Zipper 95-1 16; Myristyl 34-39;419- 
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424;499-504;536<541;Pkc_Phospho_Site276-278;380-382;387-389;442-444;591-593 

Tyr_Phospho._Site 561-568;562-568; 
DEX0291_251 Ck2JPhospho_Site 66-69;96-99; Glycosaminoglycan 50-53; Myristyl 47-52;49-54;53- 

58;62-67; Pkc_Phospho_Site 12-14;132-134;141-143;192-194;210-212; 
5 Prokar_Lipoprotein 159-169; 

DEX0291_252 Cair4>_Phospho_Site 27-30; Myristyl 15-20; PkcJPhospho_Site 5-7; 
DEX029 1_253 Asn_Glycosylation 55-58; 
DEX0291_254 Pkc_Phospho_Site 9-11; 
DEX0291_255 Camp^Phospho^Site 20-23; Myristyl 67-72; 
1 0 DEX0291_256 CawpJPhospho_Site 15-1 8; Ck2 J>hospho_Site 23-26; Pkc__Phospho_Site 1 8-20;30-32; 
DEX029 1_257 Asn_Glycosylation 2-5;27-30; Myristyl 15-20; Pkc_Phospho_Site 4-6; 
DEX0291_258 Ck2JPhospho_Site 53-56; 

DEX0291_259 Amidation 362-365;513-516;968-971; Asnjalycosylation 133-136;144-147;233- 

236;298-301;478-481;601-604;635-638;638-641;830-833;Ck2_Phospho_Site 9-12;235- 
15 238;300-303;343-346;459-462;587-590;698-701;706-709;788-791; Myristyl 35-40;53- 

58;68-73;69-74;102-107;211-216;229-234;296-301;473-478;728-733;747-752; 
Pkc_Phospho„Site86-88;212-214;235-237;343-345;353-355;480-482;617-619;706- 
708;729-731;818-820;925-927; Prokar_Lipoprotein 978-988; Tyr_Phospho_Site 697- 
704;891-898; 

20 DEX0291_260 Ck21Phospho__Site 17-20;49-52;77-80; Pkc_Phospho_Site 45-47; 
DEX029 1_261 Pkc_Phospho_Site 5-7;32-34; 

DEX0291_262 Asn_Glycosylation 76-79; Ck2_Phospho_Site 16-19;45-48; Myristyl 6-ll;9-14;56- 
61;58-63; Pkc_Phospho_Site 25-27;84-86; 

DEX0291_264 Asn_Glycosylation 19-22; Camp_Phospho_Site 3 1-34; Ck2_Phospho_Site 40-43;74-77; 
25 Myristyl 37-42;90-95; 

DEX0291_267 AsnGlycosylation 56-59;98-101; Myristyl 66-71; 

DEX029 1_268 Pkc_PhosphoJSite 33-35; 

DEX0291_269 Ck2_Phospho_Site 8-11; 

DEX0291_270 AsnGlycosylation 3-6; Ck2_Phospho_Site 6-9; 
30 DEX0291_271 Myristyl 10-15; 

DEX0291_272 Myristyl 9-14; Pkc_Phospho_Site 3-5; 

DEX0291_273 Ck2_Phospho_Site 29-32; PkcJPliospho_Site 29-3 1 ; 

DEX0291_274 Ck2J>hospho_Site 215-218; Pkc_Phospho_Site 184-186; 

DEX0291_275 Myristyl 2-7; Pkc_Phospho_Site 15-17; 
35 DEX029 1„276 Ck2 JPhospho_Site 5-8; Pkc_Phospho_Site 2 1-23;33-35;44-46;65-67; 

DEX0291_279 Myristyl 78-83;122-127; Pkc_Phospho_Site 25-27; ProkarJLipoprotein 49-59; 

DEX029 1_280 Asn_Glycosylation 1 1-14; Camp_Phospho_Site 33-36;34-37; Pkc_Phospho_Site 32- 
34;37-39;51-53; 

DEX0291__281 Ck2_Phospho_Site 35-38;44-47; 
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DEX0291_282 Camp_Phospho_Site 109-112;123-126;161-164; Ck2_Phospho_Site 98-101;141- 
144;i50-153;151-154; Myristyl 38-43; Pkc_Phospho_Site 23-25;71-73;78-80;98- 
100;133-135;141-143;150-152;155-157;156-158; 

DEX0291J283 Asn_Glycosylation 2-5; Myristyl 19-24; Pkc_Phospho_Site 51-53; 
5 DEX0291_284 Ck2J>hospho_Site 5-8;14-17;25-28;102-105;137-140;148-151; Myristyl 60-65;91-96; 

Example 6: Method of Determining Alterations in a Gene Corresponding to a 

Polynucleotide 

RNA is isolated from individual patients or from a family of individuals that have 
a phenotype of interest. cDNA is then generated from these RNA samples using 

10 protocols known in the art. See, Sambrook (2001), supra. The cDNA is then used as a 
template for PCR, employing primers surrounding regions of interest in SEQ ID NO: 1 
through 164. Suggested PCR conditions consist of 35 cycles at 95°C for 30 seconds; 
60-120 seconds at 52-58°C; and 60-120 seconds at 70°C, using buffer solutions 
described in Sidransky et aL, Science 252(5006): 706-9 (1991). See also Sidransky et aL, 

15 Science 278(5340): 1054-9 (1997). 

PCR products are then sequenced using primers labeled at their 5' end with T4 
polynucleotide kinase, employing SequiTherm Polymerase. (Epicentre Technologies). 
The intron-exon borders of selected exons is also determined and genomic PCR products 
analyzed to confirm the results. PCR products harboring suspected mutations are then 

20 cloned and sequenced to validate the results of the direct sequencing. PCR products is 
cloned into T-tailed vectors as described in Holton et aL, Nucleic Acids Res., 19: 1 156 
(1991) and sequenced with T7 polymerase (United States Biochemical). Affected 
individuals are identified by mutations not present in unaffected individuals. 
Genomic rearrangements may also be determined. Genomic clones are 

25 nick-translated with digoxigenin deoxyuridine 5 ' triphosphate (Boehringer Manheim), 
and FISH is performed as described in Johnson et aL, Methods Cell Biol. 35: 73-99 
(1991). Hybridization with the labeled probe is carried out using a vast excess of human 
cot-1 DNA for specific hybridization to the corresponding genomic locus. 

Chromosomes are counterstained with 4,6-diamino-2-phenylidole and propidium 

30 iodide, producing a combination of C-and R-bands. Aligned images for precise mapping 
are obtained using a triple-band filter set (Chroma Technology, Brattleboro, VT) in 
combination with a cooled charge-coupled device camera (Photometries, Tucson, AZ) 
and variable excitation wavelength filters. Id. Image collection, analysis and 
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chromosomal fractional length measurements are performed using the ISee Graphical 
Program System. (Inovision Corporation, Durham, NC.) Chromosome alterations of the 
genomic region hybridized by the probe are identified as insertions, deletions, and 
translocations. These alterations are used as a diagnostic marker for an associated 
5 disease. 

Example 7: Method of Detecting Abnormal Levels of a Polypeptide in a Biological 
Sample 

Antibody-sandwich ELIS As are used to detect polypeptides in a sample, 
preferably a biological sample. Wells of a microtiter plate are coated with specific 

10 antibodies, at a final concentration of 0.2 to 10 jig/ml. The antibodies are either 

monoclonal or polyclonal and are produced by the method described above. The wells 
are blocked so that non-specific binding of the polypeptide to the well is reduced. The 
coated wells are then incubated for > 2 hours at RT with a sample containing the 
polypeptide. Preferably, serial dilutions of the sample should be used to validate results. 

1 5 The plates are then washed three times with deionized or distilled water to remove 
unbound polypeptide. Next, 50 |Jl of specific antibody-alkaline phosphatase conjugate, 
at a concentration of 25-400 ng, is added and incubated for 2 hours at room temperature. 
The plates are again washed three times with deionized or distilled water to remove 
unbound conjugate. 75 |Jl of 4-methylumbelliferyl phosphate (MUP) or p-nitrophenyl 

20 phosphate (NPP) substrate solution are added to each well and incubated 1 hour at room 
temperature. 

The reaction is measured by a microtiter plate reader. A standard curve is 
prepared, using serial dilutions of a control sample, and polypeptide concentrations are 
plotted on the X-axis (log scale) and fluorescence or absorbance on the Y-axis (linear 
25 scale). The concentration of the polypeptide in the sample is calculated using the 
standard curve. 

Example 8: Formulating a Polypeptide 

The secreted polypeptide composition will be formulated and dosed in a fashion 
consistent with good medical practice, taking into account the clinical condition of the 
30 individual patient (especially the side effects of treatment with the secreted polypeptide 
alone), the site of delivery, the method of administration, the scheduling of 
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administration, and other factors known to practitioners. The "effective amount" for 
purposes herein is thus determined by such considerations. 

As a general proposition, the total pharmaceutical^ effective amount of secreted 
polypeptide administered parenterally per dose will be in the range of about 1 , p,g/kg/day 
5 to 10 mg/kg/day of patient body weight, although, as noted above, this will be subject to 
therapeutic discretion. More preferably, this dose is at least 0.01 mg/kg/day, and most 
preferably for humans between about 0.01 and 1 mg/kg/day for the hormone. If given 
continuously, the secreted polypeptide is typically administered at a dose rate of about 1 
|ig/kg/hour to about 50 mg/kg/hour, either by 1-4 injections per day or by continuous 
10 subcutaneous infusions, for example, using a mini-pump. An intravenous bag solution 
may also be employed. The length of treatment needed to observe changes and the 
interval following treatment for responses to occur appears to vary depending on the 
desired effect. 

Pharmaceutical compositions containing the secreted protein of the invention are 

1 5 administered orally, rectally, parenterally, intracistemally, intravaginally, 

intraperitoneally, topically (as by powders, ointments, gels, drops or transdermal patch), 
bucally, or as an oral or nasal spray. "Pharmaceutically acceptable carrier" refers to a 
non-toxic solid, semisolid or liquid filler, diluent, encapsulating material or formulation 
auxiliary of any type. The term "parenteral" as used herein refers to modes of 

20 administration which include intravenous, intramuscular, intraperitoneal, intrasternal, 
subcutaneous and intraarticular injection and infusion. 

The secreted polypeptide is also suitably administered by sustained-release 
systems. Suitable examples of sustained-release compositions include semipermeable 
polymer matrices in the form of shaped articles, e. g., films, or microcapsules. Sustained- 

25 release matrices include polylactides (U. S. Pat. No.3,773,919, EP 58,481), copolymers 
of L-glutamic acid and gamma-ethyl-L-glutamate (Sidman, U. et al., Biopolymers 22: 
547-556 (1983)), poly (2-hydroxyethyl methacrylate) (R. Langer et al., J. Biomed. Mater. 
Res. 15: 167-277 (1981), andR. Langer, Chem. Tech. 12: 98-105 (1982)), ethylene vinyl 
acetate (R. Langer et al.) or poly-D- (-)-3-hydroxybutyric acid (EP 133,988). Sustained- 

30 release compositions also include liposomally entrapped polypeptides. Liposomes 

containing the secreted polypeptide are prepared by methods known per se: DE Epstein 
et al., Proc. Natl. Acad. Sci. USA 82: 3688-3692 (1985); Hwang et al., Proc. Natl. Acad. 
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Sci. USA 77: 4030-4034 (1980); EP 52,322; EP 36,676; EP 88,046; EP 143,949; EP 
142,641; Japanese Pat. Appl. 83-118008; U. S. Pat Nos. 4,485,045 and 4,544,545; and 
EP 102,324. Ordinarily, the liposomes are of the small (about 200-800 Angstroms) 
unilamellar type in which the lipid content is greater than about 30 mol. percent 
5 cholesterol, the selected proportion being adjusted for the optimal secreted polypeptide 
therapy. 

For parenteral administration, in one embodiment, the secreted polypeptide is 
formulated generally by mixing it at the desired degree of purity, in a unit dosage 
injectable form (solution, suspension, or emulsion), with a pharmaceutically acceptable 

10 carrier, I e., one that is non-toxic to recipients at the dosages and concentrations 
employed and is compatible with other ingredients of the formulation. 

For example, the formulation preferably does not include oxidizing agents and 
other compounds that are known to be deleterious to polypeptides. Generally, the 
formulations are prepared by contacting the polypeptide uniformly and intimately with 

1 5 liquid carriers or finely divided solid carriers or both. Then, if necessary, the product is 
shaped into the desired formulation. Preferably the carrier is a parenteral carrier, more 
preferably a solution that is isotonic with the blood of the recipient. Examples of such 
carrier vehicles include water, saline, Ringer's solution, and dextrose solution. Non- 
aqueous vehicles such as fixed oils and ethyl oleate are also useful herein, as well as 

20 liposomes. 

The carrier suitably contains minor amounts of additives such as substances that 
enhance isotonicity and chemical stability. Such materials are non-toxic to recipients at 
the dosages and concentrations employed, and include buffers such as phosphate, citrate, 
succinate, acetic acid, and other organic acids or their salts; antioxidants such as ascorbic 

25 acid; low molecular weight (less than about ten residues) polypeptides, e. g., polyarginine 
or tripeptides; proteins, such as serum albumin, gelatin, or immunoglobulins; hydrophilic 
polymers such as polyvinylpyrrolidone; amino acids, such as glycine, glutamic acid, 
aspartic acid, or arginine; monosaccharides, disaccharides, and other carbohydrates 
including cellulose or its derivatives, glucose, manose, or dextrins; chelating agents such 

30 as EDTA; sugar alcohols such as mannitol or sorbitol; counterions such as sodium; 
and/or nonionic surfactants such as polysorbates, poloxamers, or PEG. 
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The secreted polypeptide is typically formulated in such vehicles at a 
concentration of ahout 0.1 mg/ml to 100 mg/ml, preferably 1-10 mg/ml, at a pH of about 
3 to 8. It will be understood that the use of certain of the foregoing excipients, carriers, or 
stabilizers will result in the formation of polypeptide salts. 
5 Any polypeptide to be used for therapeutic administration can be sterile. Sterility 

is readily accomplished by filtration through sterile filtration membranes (e. g., 0.2 
micron membranes). Therapeutic polypeptide compositions generally are placed into a 
container having a sterile access port, for example, an intravenous solution bag or vial 
having a stopper pierceable by a hypodermic injection needle. 

10 Polypeptides ordinarily will be stored in unit or multi-dose containers, for 

example, sealed ampules or vials, as an aqueous solution or as a lyophilized formulation 
for reconstitution. As an example of a lyophilized formulation, 10-ml vials are filled with 
5 ml of sterile-filtered 1 % (w/v) aqueous polypeptide solution, and the resulting mixture 
is lyophilized. The infusion solution is prepared by reconstituting the lyophilized 

1 5 polypeptide using bacteriostatic Water-for-Injection. 

The invention also provides a pharmaceutical pack or kit comprising one or more 
containers filled with one or more of the ingredients of the pharmaceutical compositions 
of the invention. Associated with such container (s) can be a notice in the form 
prescribed by a governmental agency regulating the manufacture, use or sale of 

20 pharmaceuticals or biological products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. In addition, the polypeptides of the 
present invention may be employed in conjunction with other therapeutic compounds. 
Example 9: Method of Treating Decreased Levels of the Polypeptide 

It will be appreciated that conditions caused by a decrease in the standard or 

25 normal expression level of a secreted protein in an individual can be treated by 

administering the polypeptide of the present invention, preferably in the secreted form. 
Thus, the invention also provides a method of treatment of an individual in need of an 
increased level of the polypeptide comprising administering to such an individual a 
pharmaceutical composition comprising an amount of the polypeptide to increase the 

30 activity level of the polypeptide in such an individual. 

For example, a patient with decreased levels of a polypeptide receives a daily 
dose 0.1-100 |xg/kg of the polypeptide for six consecutive days. Preferably, the 
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polypeptide is in the secreted form. The exact details of the dosing scheme, based on 

administration and formulation, are provided above. 

Example 10: Method of Treating Increased Levels of the Polypeptide 

Antisense technology is used to inhibit production of a polypeptide of the present 
5 invention. This technology is one example of a method of decreasing levels of a 
polypeptide, preferably a secreted form, due to a variety of etiologies, such as cancer. 

For example, a patient diagnosed with abnormally increased levels of a 
polypeptide is administered intravenously antisense polynucleotides at 0.5, 1.0, 1.5, 2.0 
and 3.0 mg/kg day for 21 days. This treatment is repeated after a 7-day rest period if the 
10 treatment was well tolerated. The formulation of the antisense polynucleotide is provided 
above. 

Example 11: Method of Treatment Using Gene Therapy 

One method of gene therapy transplants fibroblasts, which are capable of 
expressing a polypeptide, onto a patient. Generally, fibroblasts are obtained from a 

1 5 subject by skin biopsy. The resulting tissue is placed in tissue-culture medium and 
separated into small pieces. Small chunks of the tissue are placed on a wet surface of a 
tissue culture flask, approximately ten pieces are placed in each flask. The flask is turned 
upside down, closed tight and left at room temperature over night. After 24 hours at room 
temperature, the flask is inverted and the chunks of tissue remain fixed to the bottom of 

20 the flask and fresh media (e. g., Ham's F12 media, with 10% FBS, penicillin and 
streptomycin) is added. The flasks are then incubated at 37°C for approximately one 
week. 

At this time, fresh media is added and subsequently changed every several days. 
After an additional two weeks in culture, a monolayer of fibroblasts emerge. The 
25 monolayer is trypsinized and scaled into larger flasks. pMV-7 (Kirschmeier, P. T. et al., 
DNA, 7: 219-25 (1988)), flanked by the long terminal repeats of the Moloney murine 
sarcoma virus, is digested with EcoRI and Hindm and subsequently treated with calf 
intestinal phosphatase. The linear vector is fractionated on agarose gel and purified, using 
glass beads. 

30 The cDNA encoding a polypeptide of the present invention can be amplified 

using PCR primers which correspond to the 5 ! and 3'end sequences respectively as set 
forth in Example 1. Preferably, the 5 primer contains an EcoRI site and the 3 'primer 
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includes a Hindin site. Equal quantities of the Moloney murine sarcoma virus linear 
backbone and the amplified EcoRI and HindlH fragment are added together, in the 
presence of T4 DNA ligase. The resulting mixture is maintained under conditions 
appropriate for ligation of the two fragments. The ligation mixture is then used to 
5 transform bacteria HB 101, which are then plated onto agar containing kanamycin for the 
purpose of confirming that the vector has the gene of interest properly inserted. 

The amphotropic pA317 or GP+aml2 packaging cells are grown in tissue culture 
to confluent density in Dulbecco's Modified Eagles Medium (DMEM) with 10% calf 
serum (CS), penicillin and streptomycin. The MSV vector containing the gene is then 

10 added to the media and the packaging cells transduced with the vector. The packaging 
cells now produce infectious viral particles containing the gene (the packaging cells are 
now referred to as producer cells). 

Fresh media is added to the transduced producer cells, and subsequently, the 
media is harvested from a 10 cm plate of confluent producer cells. The spent media, 

15 containing the infectious viral particles, is filtered through a millipore filter to remove 
detached producer cells and this media is then used to infect fibroblast cells. Media is 
removed from a sub-confluent plate of fibroblasts and quickly replaced with the media 
from the producer cells. This media is removed and replaced with fresh media. 

If the titer of virus is high, then virtually all fibroblasts will be infected and no 

20 selection is required. If the titer is very low, then it is necessary to use a retroviral vector 
that has a selectable marker, such as neo or his. Once the fibroblasts have been efficiently 
infected, the fibroblasts are analyzed to determine whether protein is produced. 

The engineered fibroblasts are then transplanted onto the host, either alone or 
after having been grown to confluence on cytodex 3 microcarrier beads. 

25 Example 12: Method of Treatment Using Gene Therapy-J/i Vivo 

Another aspect of the present invention is using in vivo gene therapy methods to 
treat disorders, diseases and conditions. The gene therapy method relates to the 
introduction of naked nucleic acid (DNA, RNA, and antisense DNA or RNA) sequences 
into an animal to increase or decrease the expression of the polypeptide. 

30 The polynucleotide of the present invention may be operatively linked to a 

promoter or any other genetic elements necessary for the expression of the polypeptide 
by the target tissue. Such gene therapy and delivery techniques and methods are known 
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in the art, see, for example, WO 90/11092, WO 98/11779; U. S. Patent 5,693,622; 
5,705,151; 5,580,859; TabataH. et al. (1997) Cardiovasc. Res. 35 (3): 470-479, Chao J et 
al. (1997) Pharmacol. Res. 35 (6): 517-522, Wolff J. A. (1997) Neuromuscul. Disord. 7 
(5): 314-318, Schwartz B. et al. (1996) Gene Ther. 3 (5): 405-411, Tsurumi Y. et al. 
5 (1996) Circulation 94 (12): 3281-3290 (incorporated herein by reference). 

The polynucleotide constructs may be delivered by any method that delivers 
injectable materials to the cells of an animal, such as, injection into the interstitial space 
of tissues (heart, muscle, skin, lung, liver, intestine and the like). The polynucleotide 
constructs can be delivered in a phannaceutically acceptable liquid or aqueous carrier. 

1 0 The term "naked" polynucleotide, DNA or RNA, refers to sequences that are free 

from any delivery vehicle that acts to assist, promote, or facilitate entry into the cell, 
including viral sequences, viral particles, liposome formulations, lipofectin or 
precipitating agents and the like. However, the polynucleotides of the present invention 
may also be delivered in liposome formulations (such as those taught in Feigner P. L. et 

15 al. (1995) Ann. NY Acad. Sci. 772: 126-139 and Abdallah B. et al. (1995) Biol. Cell 85 
(1): 1-7) which can be prepared by methods well known to those skilled in the art. 

The polynucleotide vector constructs used in the gene therapy method are 
preferably constructs that will not integrate into the host genome nor will they contain 
sequences that allow for replication. Any strong promoter known to those skilled in the 

20 art can be used for driving the expression of DNA. Unlike other gene therapies 

techniques, one major advantage of introducing naked nucleic acid sequences into target 
cells is the transitory nature of the polynucleotide synthesis in the cells. Studies have 
shown that non-replicating DNA sequences can be introduced into cells to provide 
production of the desired polypeptide for periods of up to six months. 

25 The polynucleotide construct can be delivered to the interstitial space of tissues 

within the an animal, including of muscle, skin, brain, lung, liver, spleen, bone marrow, 
thymus, heart, lymph, blood, bone, cartilage, pancreas, kidney, gall bladder, stomach, 
intestine, testis, ovary, uterus, rectum, nervous system, eye, gland, and connective tissue. 
Interstitial space of the tissues comprises the intercellular fluid, mucopolysaccharide 

30 matrix among the reticular fibers of organ tissues, elastic fibers in the walls of vessels or 
chambers, collagen fibers of fibrous tissues, or that same matrix within connective tissue 
ensheathing muscle cells or in the lacunae of bone. It is similarly the space occupied by 



WO 02/068633 PCT/USO 1/436 12 

-149- 

the plasma of the circulation and the lymph fluid of the lymphatic channels. Delivery to 
the interstitial space of muscle tissue is preferred for the reasons discussed below. They 
may be conveniently delivered by injection into the tissues comprising these cells. They 
are preferably delivered to and expressed in persistent, non-dividing cells which are 
5 differentiated, although delivery and expression may be achieved in non-differentiated or 
less completely differentiated cells, such as, for example, stem cells of blood or skin 
fibroblasts. In vivo muscle cells are particularly competent in their ability to take up and 
express polynucleotides. 

For the naked polynucleotide injection, an effective dosage amount of DNA or 

10 RNA will be in the range of from about 0.05 H-g/kg body weight to about 50 mg/kg body 
weight. Preferably the dosage will be from about 0.005 mg/kg to about 20 mg/kg and 
more preferably from about 0.05 mg/kg to about 5 mg/kg. Of course, as the artisan of 
ordinary skill will appreciate, this dosage will vary according to the tissue site of 
injection. The appropriate and effective dosage of nucleic acid sequence can readily be 

1 5 determined by those of ordinary skill in the art and may depend on the condition being 
treated and the route of administration. The preferred route of administration is by the 
parenteral route of injection into the interstitial space of tissues. However, other 
parenteral routes may also be used, such as, inhalation of an aerosol formulation 
particularly for delivery to lungs or bronchial tissues, throat or mucous membranes of the 

20 nose. In addition, naked polynucleotide constructs can be delivered to arteries during 
angioplasty by the catheter used in the procedure. 

The dose response effects of injected polynucleotide in muscle in vivo is 
determined as follows. Suitable template DNA for production of mRNA coding for 
polypeptide of the present invention is prepared in accordance with a standard 

25 recombinant DNA methodology. The template DNA, which may be either circular or 
linear, is either used as naked DNA or complexed with liposomes. The quadriceps 
muscles of mice are then injected with various amounts of the template DNA. 

Five to six week old female and male Balb/C mice are anesthetized by 
intraperitoneal injection with 0.3 ml of 2.5% Avertin. A 1.5 cm incision is made on the 

30 anterior thigh, and the quadriceps muscle is directly visualized. The template DNA is 
injected in 0.1 ml of carrier in a 1 cc syringe through a 27 gauge needle over one minute, 
approximately 0.5 cm from the distal insertion site of the muscle into the knee and about 
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0.2 cm deep. A suture is placed over the injection site for future localization, and the skin 
is closed with stainless steel clips. 

After an appropriate incubation time (e. g., 7 days) muscle extracts are prepared 
by excising the entire quadriceps. Every fifth 15 um cross-section of the individual 
5 quadriceps muscles is histochemically stained for protein expression. A time course for 
protein expression may be done in a similar fashion except that quadriceps from different 
mice are harvested at different times. Persistence of DNA in muscle following injection 
may be determined by Southern blot analysis after preparing total cellular DNA and 
HIRT supernatants from injected and control mice. 

10 The results of the above experimentation in mice can be use to extrapolate proper 

dosages and other treatment parameters in humans and other animals using naked DNA. 
Example 13: Transgenic Animals 

The polypeptides of the invention can also be expressed in transgenic animals. 
Animals of any species, including, but not limited to, mice, rats, rabbits, hamsters, guinea 

15 pigs, pigs, micro-pigs, goats, sheep, cows and non-human primates, e. g., baboons, 
monkeys, and chimpanzees may be used to generate transgenic animals. In a specific 
embodiment, techniques described herein or otherwise known in the art, are used to 
express polypeptides of the invention in humans, as part of a gene therapy protocol. 
Any technique known in the art may be used to introduce the transgene (i. e., 

20 polynucleotides of the invention) into animals to produce the founder lines of transgenic 
animals. Such techniques include, but are not limited to, pronuclear microinjection 
(Paterson et al., Appl. Microbiol. Biotechnol. 40: 691-698 (1994); Carver et al, 
Biotechnology (NY) 11: 1263-1270 (1993); Wright et al, Biotechnology (NY) 9: 830- 
834 (1991); and Hoppe et al., U. S. Patent 4,873,191 (1989)); retrovirus mediated gene 

25 transfer into germ lines (Van der Putten et al., Proc. Natl. Acad. Sci., USA 82: 6148-6152 
(1985)), blastocysts or embryos; gene targeting in embryonic stem cells (Thompson et 
aL, Cell 56: 313-321 (1989)); electroporation of cells or embryos (Lo, 1983, Mol Cell. 
Biol 3: 1803-1814 (1983)); introduction of the polynucleotides of the invention using a 
gene gun (see, e. g., Ulmer et al., Science 259: 1745 (1993); introducing nucleic acid 

30 constructs into embryonic pleuripotent stem cells and transferring the stem cells back 
into the blastocyst; and sperm mediated gene transfer (Lavitrano et al., Cell 57: 717*723 
(1989); etc. For a review of such techniques, see Gordon, "Transgenic Animals," Intl. 
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Rev. Cytol. 115: 171-229 (1989), which is incorporated by reference herein in its 
entirety. 

Any technique known in the art may be used to produce transgenic clones 
containing polynucleotides of the invention, for example, nuclear transfer into enucleated 
5 oocytes of nuclei from cultured embryonic, fetal, or adult cells induced to quiescence 
(Campell et al, Nature 380: 64-66 (1996); Wilmut et al., Nature 385: 810813 (1997)). 

The present invention provides for transgenic animals that carry the transgene in 
all their cells, as well as animals which carry the transgene in some, but not all their cells, 
L e., mosaic animals or chimeric. The transgene may be integrated as a single transgene 

10 or as multiple copies such as in concatamers, e. g., head-to-head tandems or head-to-tail 
tandems. The transgene may also be selectively introduced into and activated in a 
particular cell type by following, for example, the teaching of Lasko et al. (Lasko et al. s 
Proc. Natl. Acad. Sci. USA 89: 6232-6236 (1992)). The regulatory sequences required 
for such a cell-type specific activation will depend upon the particular cell type of 

15 interest, and will be apparent to those of skill in the art. When it is desired that the 
polynucleotide transgene be integrated into the chromosomal site of the endogenous 
gene, gene targeting is preferred. Briefly, when such a technique is to be utilized, vectors 
containing some nucleotide sequences homologous to the endogenous gene are designed 
for the purpose of integrating, via homologous recombination with chromosomal 

20 sequences, into and disrupting the function of the nucleotide sequence of the endogenous 
gene. The transgene may also be selectively introduced into a particular cell type, thus 
inactivating the endogenous gene in only that cell type, by following, for example, the 
teaching of Gu et al. (Gu et al., Science 265: 103-106 (1994)). The regulatory sequences 
required for such a cell-type specific inactivation will depend upon the particular cell 

25 type of interest, and will be apparent to those of skill in the art. 

Once transgenic animals have been generated, the expression of the recombinant 
gene may be assayed utilizing standard techniques. Initial screening may be 
accomplished by Southern blot analysis or PGR techniques to analyze animal tissues to 
verify that integration of the transgene has taken place. The level of mRNA expression 

30 of the transgene in the tissues of the transgenic animals may also be assessed using 

techniques which include, but are not limited to, Northern blot analysis of tissue samples 
obtained from the animal, in situ hybridization analysis, and reverse transcriptase-PCR 
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(rt-PCR). Samples of transgenic gene-expressing tissue may also be evaluated 
immunocytochemically or immunohistochemically using antibodies specific for the 
transgene product. 

Once the founder animals are produced, they may be bred, inbred, outbred, or 
5 crossbred to produce colonies of the particular animal. Examples of such breeding 
strategies include, but are not limited to: outbreeding of founder animals with more than 
one integration site in order to establish separate lines; inbreeding of separate lines in 
order to produce compound transgenics that express the transgene at higher levels 
because of the effects of additive expression of each transgene; crossing of heterozygous 

10 transgenic animals to produce animals homozygous for a given integration site in order to 
both augment expression and eliminate the need for screening of animals by DNA 
analysis; crossing of separate homozygous lines to produce compound heterozygous or 
homozygous lines; and breeding to place the transgene on a distinct background that is 
appropriate for an experimental model of interest. 

15 Transgenic animals of the invention have uses which include, but are not limited 

to, animal model systems useful in elaborating the biological function of polypeptides of 
the present invention, studying conditions and/or disorders associated with aberrant 
expression, and in screening for compounds effective in ameliorating such conditions 
and/or disorders. 

20 Example 14: Knock-Out Animals 

Endogenous gene expression can also be reduced by inactivating or "knocking 
out" the gene and/or its promoter using targeted homologous recombination. (E. g., see 
Smithies et al., Nature 317: 230-234 (1985); Thomas & Capecchi, Cell 51: 503512 
(1987); Thompson et al., Cell 5: 313-321 (1989); each of which is incorporated by 

25 reference herein in its entirety). For example, a mutant, non-functional polynucleotide of 
the invention (or a completely unrelated DNA sequence) flanked by DNA homologous to 
the endogenous polynucleotide sequence (either the coding regions or regulatory regions 
of the gene) can be used, with or without a selectable marker and/or a negative selectable 
marker, to transfect cells that express polypeptides of the invention in vivo. In another 

30 embodiment, techniques known in the art are used to generate knockouts in cells that 
contain, but do not express the gene of interest. Insertion of the DNA construct, via* 
targeted homologous recombination, results in inactivation of the targeted gene. Such 
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approaches are particularly suited in research and agricultural fields where modifications 
to embryonic stem cells can be used to generate animal offspring with an inactive 
targeted gene (e. g., see Thomas & Capecchi 1987 and Thompson 1989, supra). 
However this approach can be routinely adapted for use in humans provided the 
5 recombinant DNA constructs are directly administered or targeted to the required site in 
vivo using appropriate viral vectors that will be apparent to those of skill in the art. 

In further embodiments of the invention, cells that are genetically engineered to 
express the polypeptides of the invention, or alternatively, that are genetically engineered 
not to express the polypeptides of the invention (e. g., knockouts) are administered to a 

10 patient in vivo. Such cells may be obtained from the patient (I. e., animal, including 
human) or an MHC compatible donor and can include, but are not limited to fibroblasts, 
bone marrow cells, blood cells (e. g., lymphocytes), adipocytes, muscle cells, endothelial 
cells etc. The cells are genetically engineered in vitro using recombinant DNA techniques 
to introduce the coding sequence of polypeptides of the invention into the cells, or 

15 alternatively, to disrupt the coding sequence and/or endogenous regulatory sequence 
associated with the polypeptides of the invention, e. g., by transduction (using viral 
vectors, and preferably vectors that integrate the transgene into the cell genome) or 
transfection procedures, including, but not limited to, the use of plasmids, cosmids, 
YACs, naked DNA, electroporation, liposomes, etc. 

20 The coding sequence of the polypeptides of the invention can be placed under the 

control of a strong constitutive or inducible promoter or promoter/enhancer to achieve 
expression, and preferably secretion, of the polypeptides of the invention. The engineered 
cells which express and preferably secrete the polypeptides of the invention can be 
introduced into the patient systemically, e. g., in the circulation, or intraperitoneally. 

25 Alternatively, the cells can be incorporated into a matrix and implanted in the 

body, e. g., genetically engineered fibroblasts can be implanted as part of a skin graft; 
genetically engineered endothelial cells can be implanted as part of a lymphatic or 
vascular graft. (See, for example, Anderson et al U. S. Patent 5,399,349; and Mulligan & 
Wilson, U. S. Patent 5,460,959 each of which is incorporated by reference herein in its 

30 entirety). 

When the cells to be administered are non-autologous or non-MHC compatible 
cells, they can be administered using well known techniques which prevent the 
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development of a host immune response against the introduced cells. For example, the 
cells may be introduced in an encapsulated form which, while allowing for an exchange 
of components with the immediate extracellular environment, does not allow the 
introduced cells to be recognized by the host immune system. 

5 Transgenic and "knock-out" animals of the invention have uses which include, 

but are not limited to, animal model systems useftd in elaborating the biological function 
of polypeptides of the present invention, studying conditions and/or disorders associated 
with aberrant expression, and in screening for compounds effective in ameliorating such 
conditions and/or disorders. 

10 All patents, patent publications, and other published references mentioned herein 

are hereby incorporated by reference in their entireties as if each had been individually 
and specifically incorporated by reference herein. While preferred illustrative 
embodiments of the present invention are described, one skilled in the art will appreciate 
that the present invention can be practiced by other than the described embodiments, 

1 5 which are presented for purposes of illustration only and not by way of limitation. The 
present invention is limited only by the claims that follow. 
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CLAIMS 

We claim: 

1 . An isolated nucleic acid molecule comprising 

(a) a nucleic acid molecule comprising a nucleic acid sequence that encodes 
5 an amino acid sequence of SEQ DD NO: 165 through 284; 

(b) a nucleic acid molecule comprising a nucleic acid sequence of SEQ ID 
NO; 1 through 164; 

(c) a nucleic acid molecule that selectively hybridizes to the nucleic acid 
molecule of (a) or (b); or 

10 (d) a nucleic acid molecule having at least 60% sequence identity to the nucleic 

acid molecule of (a) or (b). 

2. The nucleic acid molecule according to claim 1, wherein the nucleic acid 
molecule is a cDNA. 

15 

3. The nucleic acid molecule according to claim 1, wherein the nucleic acid 
molecule is genomic DNA. 

4. The nucleic acid molecule according to claim 1, wherein the nucleic acid 
20 molecule is a mammalian nucleic acid molecule. 

5. The nucleic acid molecule according to claim 4, wherein the nucleic acid 
molecule is a human nucleic acid molecule. 

25 6. A method for determining the presence of a lung specific nucleic acid 

(LSNA) in a sample, comprising the steps of: 

(a) contacting the sample with the nucleic acid molecule according to claim 1 

under conditions in which the nucleic acid molecule will selectively hybridize to a lung 

specific nucleic acid; and 
30 (b) detecting hybridization of the nucleic acid molecule to a LSNA in the 

sample, wherein the detection of the hybridization indicates the presence of a LSNA in 

the sample. 
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7. A vector comprising the nucleic acid molecule of claim 1 . 

8. A host cell comprising the vector according to claim 7. 

9. A method for producing a polypeptide encoded by the nucleic acid molecule 
according to claim 1, comprising the steps of (a) providing a host cell comprising the 
nucleic acid molecule operably linked to one or more expression control sequences, and 
(b) incubating the host cell under conditions in which the polypeptide is produced. 

10. A polypeptide encoded by the nucleic acid molecule according to claim 1 . 

11. An isolated polypeptide selected from the group consisting of: 

(a) a polypeptide comprising an amino acid sequence with at least 60% 
sequence identity to of SEQ ID NO: 165 through 284 ; or 

(b) a polypeptide comprising an amino acid sequence encoded by a nucleic 
acid molecule comprising a nucleic acid sequence of SEQ ID NO: 1 through 164. 

12. An antibody or fragment thereof that specifically binds to the polypeptide 
according to claim 1 1 . 

13. A method for determining the presence of a lung specific protein in a sample, 
comprising the steps of: 

(a) contacting the sample with the antibody according to claim 12 under 
conditions in which the antibody will selectively bind to the lung specific protein; and 

(b) detecting binding of the antibody to a lung specific protein in the sample, 
wherein the detection of binding indicates the presence of a lung specific protein in the 
sample. 

14. A method for diagnosing and monitoring the presence and metastases of 
lung cancer in a patient, comprising the steps of: 
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(a) determining an amount of the nucleic acid molecule of claim 1 or a 
polypeptide of claim 6 in a sample of a patient; and 

(b) comparing the amount of the determined nucleic acid molecule or the 
polypeptide in the sample of the patient to the amount of the lung specific marker in a 

5 normal control; wherein a difference in the amount of the nucleic acid molecule or the 
polypeptide in the sample compared to the amount of the nucleic acid molecule or the 
polypeptide in the normal control is associated with the presence of lung cancer. 



15. A kit for detecting a risk of cancer or presence of cancer in a patient, said 
10 kit comprising a means for determining the presence the nucleic acid molecule of claim 1 

or a polypeptide of claim 6 in a sample of a patient. 

16. A method of treating a patient with lung cancer, comprising the step of 
administering a composition according to claim 12 to a patient in need thereof, wherein 

15 said administration induces an immune response against the lung cancer cell expressing 
the nucleic acid molecule or polypeptide. 

17. A vaccine comprising the polypeptide or the nucleic acid encoding the 
polypeptide of claim 1 1 . 



20 
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SEQUENCE LISTING 

<110> Macina, Roberto 
Re c ipon , He rve 
Chen, Sei-Yu 
Sun, Yongming 
Liu, Chenghua 
diaDexus , Inc . 

<120> Compositions and Methods Relating to Lung Specific Genes and 
Proteins 

<130> DEX-0292 

<150> 60/252,500 
<151> 2000-11-22 

<160> 284 

<170> Patentln version 3.1 

<210> 1 

<211> 338 

<212> DNA 

<213> Homo sapien 



<400> 1 

tggatacgag cggacagccc 


gggcaggtac 


cgcatactag 


caaaggtaat 


ggtgatctag 


60 


caaacaaaat tggtttctag 


cagttagaag 


tgagcaggag 


cacttgtatt 


atagtattta 


12 0 


aataatcctg gttaatctct 


ttttaagccg 


agtaacccct 


ccagattttg 


cctttttatt 


180 


attgaggctg gctttatttt 


cttctacttt 


ttttcccgtt 


ttatagcagt 


taattatttt 


240 


tgtgattatt atgcaagaag 


cattgccctt 


gagttaaact 


gttattgttt 


cataagcagc 


300 


tattaaaata actgagcatg 


ttttatgaac 


atacacta 






338 


<210> 2 

<211> 2446 

<212> DNA 

<213> Homo sapien 












<400> 2 

aaggatcctt aattaaatta 


atcccccccc 


cccttttttt 


ttgtattctt 


gccagtacag 


60 


tatatggttt ttctacccca 


attacatact 


gggttttgta 


ccacatcact 


aaaggcccaa 


120 


atcattgaag atacaaaacc 


gtacatgcag 


gctggttgtc 


tggttagtca 


atggctgatt 


180 


tgcttcaact gtctagtatg 


tatgtgcagc 


ctgaaactgg 


ctccttaaaa 


ggaaagccgg 


240 


gtcagtcatc ttgaaaaaat 


gacatgtaaa 


agtaaatcga 


taattgtttt 


gagagacggt 


300 


acatgtttta aaggttggcc 


ttaagcttca 


gtaacattgt 


cattttgtga 


ccttttgttg 


360 


tcacacctgt accctaacct 


gacaggaatt 


aactactgtt 


tttttgtggg 


gcagaaagca 


420 
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aaacctggtg 


ttgtgacttt 


tatcctaatg 


gttcttaggc 


aaggttagtg 


agaagaaaca 


480 


caaacccaga 


tgcatgcatt 


gtgcattatt 


ttgtagacaa 


gctacttttt 


cttctgtccc 


540 


tttaacaaat 


ttgcagcaat 


taccctccct 


ttggggtcta 


gagtgaaagc 


taatttgtgg 


600 


gtagatgaga 


ttgcagaaga 


atggatgtcc 


atggctgtga 


acactgcaca 


ctgcacatcc 


660 


atctccagtg 


ctcacactgt 


gcagctacca 


ctccctggct 


gcgtgccatg 


ctgtcgggtt 


720 


gcagatttgc 


acacataaat 


tcctcaggaa 


gagtttgcat 


gagcatcacc 


tcgcaatatt 


780 


ctgtactgac 


caaacaaggg 


atttgaacgt 


ttttcagcac 


aaaaggataa 


cttccgagtg 


840 


gtggtctgta 


cgcatactag 


caaaggtaat 


ggtgatctag 


caaacaaaat 


tggtttctgc 


900 


agttagaagt 


gagcaggagc 


acttgtatta 


tagtatttaa 


ataatcctgg 


ttaatctctt 


960 


tttaagccga 


gtaacccctc 


cagattttgc 


ctttttatta 


ttgaggctgg 


ctttattttc 


1020 


ttctactttt 


tttcccgttt 


tatagcagtt 


aattattttt 


gtgattatta 


tgcaagaagc 


1080 


attgcccttg 


agttaaactg 


ttattgtttc 


ataagcagct 


attaaaataa 


ctgagcattg 


1140 


ttttatgaac 


atacactaat 


ctgagatact 


gaaaagcttt 


gcaactaaaa 


agcaaaacaa 


1200 


cctacattag 


tcatctagcc 


attgtttgga 


tgttttgagt 


tgatttttta 


tggtgcctct 


1260 


tttagcttgg 


aatattacgt 


ttactttaat 


ccaagtctag 


gccttttaaa 


gggtccttaa 


1320 


aattaaagtt 


cagaatgtga 


atccctttga 


catctattac 


aggtttatag 


gacctttttg 


1380 


gttgtgatta 


ctgttttcaa 


tacgattgta 


taaatgaagt 


taactttgtc 


agaagttaaa 


1440 


atggaggtca 


taggagttcc 


tggagaaatg 


gctctcctgt 


ttctttcatt 


accccactga 


1500 


agttcacccc 


agtttctggc 


cacaagaata 


tgagaaagga 


accctgttgt 


tttccaaggg 


1560 


aaatcattcc 


tctctgtccc 


cactgttgat 


taactaaagt 


cctggacacc 


ttccttcctc 


1620 


cactggccaa 


gacccacctt 


gacccacctt 


gaacctcttt 


tcagagccga 


gtggcatgaa 


1680 


tatgtgtact 


gtttctgctt 


ctgttgatgg 


agtggctgtg 


ggagaattaa 


aggaaatgct 


1740 


aatttgagct 


tcattcatag 


gggaacctac 


tatatattgc 


atccctgctg 


gttggaaatt 


1800 


atcttcatct 


ctggactgca 


ttgtttagaa 


aaatgttaat 


ggcttacaat 


tctgagaact 


1860 


ttattgtgtg 


gctctggggt 


taagaattct 


gtggtttgaa 


aaaaaataaa 


tattttgtat 


1920 


tgattctcac 


gtcatttcaa 


tgttgtgact 


atgtactaaa 


tgcactaaga 


ctgggtattc 


1980 


tcttagaaga 


gtgcgttttg 


ttaaacagat 


ggcagttcac 


tctcattagt 


tctatttgtc 


2040 


aatattacag 


ccacacactt 


caggataact 


tactcaaatg 


tgaagtcatg 


ggaagctatg 


2100 


gctagtaaca 


ggaatgcttg 


tgaaaaattt 


ggagggcaag 


cacgagcaga 


ggcttttgcc 


2160 
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actcactagc acagccagag 


caaggatgaa 


3 

gcatgaagtc 


ttgttcctag 


tggtgcttgt 


2220 


afccrtr'acraacf ccafcacftaacr 


c t cage c egg 


gcgttctgca 


cat cat etc t 


ttaatcecag 


2280 


aaaaacaaat ggcggatgaa 


actcctattg 


acctgggata 


caagccctga 


ctttggctct 


2340 


tgatcttttg gtgggttctg 


aatcactcct 


tctgtattag 


ggcatatact 


ttcactctaa 


2400 


aacttgtggt aaatgaaata 


aatcttgtac 


agatgtaaaa 


aaaaaa 




2446 


<210> 3 

<211> 460 

<212> DNA 

<213> Homo sapien 












<400> 3 

acacacccaa atgtattcca 


agaccatact 


ctgtgacagc 


tgaaagatca 


aattttttgc 


60 


ttgctatcca gagatgagcc 


ttctgctcca 


ctgggaactt 


ggctctacca gctcacagac 


120 


ccctttggga agctgcagaa 


ataaaacact 


cccctctatg 


cctcactaca 


caagattaat 


180 


acacttctag tgaaagaaga 


attttaatgc 


acttgagatg 


aaagaageca 


gatgetttec 


240 


t~ai~ t* t"at~crt*c ctttccaacc 


a. l. l* u & y ciy 


^yyyyy^^yy 


aagtgtagca 


aggcttgacc 


300 


tgcaactccc aggaccctgc 


tggtcatgtc 


aatcccaagg 


gcagaaatat 


ctcttctaga 


360 


atccttccag ctcacctcca 


ctgtggctac 


aagtgaaagc 


cataagtcaa 


aegggagctg 


420 


caggaagcca cacttactgc 


actgtccaag 


aataaatcaa 






460 


<210> 4 

<211> 594 

<212> DNA 

<213> Homo sapien 












<400> 4 

acacacccaa atgtattcca 


agaccatact 


ctgtgacagc 


tgaaagatca 


aattttttgc 


60 


ttgctatcca gagatgagcc 


ttctgctcca 


ctgggaactt 


ggctctacca 


gctcacagac 


120 


ccctttggga agctgcagaa 


ataaaacact 


cccctctatg 


cctcactaca 


caagattaat 


180 


acacttctag tgaaagaaga 


attttaatgc 


acttgagatg 


aaagaageca gatgetttec 


240 


tatttatgtc ctttccaacc 


atttagagtg 


egggggatgg 


aagtgtagca 


aggcttgacc 


300 


tgcaactccc aggaccctgc 


tggtcatgtc 


aatcccaagg 


gcagaaatat 


ctcttctaga 


360 


atccttccag ctcacctcca 


ctgtggctac 


aagtgaaagc 


cataagtcaa 


aegggagctg 


420 


caggaagcca cacttactgc 


actgtccaag 


aataaatcaa 


aagcttttaa 


tagcaagaag 


480 


ttaaaaataa agcagcacgt 


ccttaaaaat 


aaaatttcct 


acaaetgaca aaataaagga 


540 


cgaacacgca ctggttgttt 


cttaaccact 


agtggacaag 


caaagggaaa 


aagc 


594 
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<210> 5 

<211> 283 

<212> DNA 

<213> Homo sapien 

<400> 5 



actactaaag gcaaccggca 


tggactcata atatttgtgg ggacagcaaa 


aagactaaaa 


60 


gttctaagga 


agaaaatgcg 


aaccttgata gtttgaaata gttaaaaaga 


cagtgtagaa 


120 


actgcttagg 


cagttggatt 


atggactatt agatgatact 


tgggtctgat 


aatggtataa 


180 


ggagaataaa gtatttaggg 


atccaatatt acgcctgcag 


ctttttccaa 


atagttcatg 


240 


ggggaggggg 


atgtgtaagt 


ggttaactga agtctaacta gat 




283 


<210> 6 
<211> 2237 
<212> DNA 
<213> Homo sapien 










<400> 6 
tttttttttt 


caagaaaaaa 


aaatcacttt aattgaggaa 


cactttcagt 


ttgtgacaaa 


60 


attatgctgt 


gaatcaggtg 


ttgcaaatta tggcccactg 


cctgcttttg 


tgtaagtttt 


120 


attggaacac 


agctacattc 


agtccatggc tgcttttaga 


atacaacagt 


agactttaac 


180 


atttggaaca 


gggaacagaa 


accagagcca tacagctaat 


aaacttgaaa 


atatttacaa 


240 


gttgatgctt 


tacaaaatcc 


atctgctgac ccctgctctg 


taccattgtt 


ctcttctgat 


300 


ggtctgttta 


ctaaaaaata 


aaaacttcac aaacatgtaa 


aaaatagatt 


tgccatttaa 


360 


aatgtgcttt 


tcaagtttga 


ctttttagga tgcaattaat 


tcactaaata 


cagaacttaa 


420 


ctaaggacaa 


aatttaaaga 


tcagcattct ttcccttccc 


atcacgctca 


acttaacatg 


480 


aagaactgta 


aacatcctaa 


gcttacaaca aacctatcta gttagacttc 


agttaaccac 


540 


ttacacatcc 


ccctccccca 


tgaactattt ggaaaaagct gcaggcgtaa 


tattggatcc 


600 


ctaaatactt 


tattctcctt 


ataccattat cagacccaag 


tatcatctaa 


tagtccataa 


660 


tcaaactgcc 


taagcagttt 


ctacactgtc tttttaacta 


•tttcaaacta 


tcaaggttcg 


720 


cattttcttc 


cttagaactt 


ttagtctttt tcttccccaa 


aatatttgag 


tccatgccag. 


780 


ttgcctttag 


ttgtacccaa 


ataatggttt gtctatttcc 


taaaagtagt 


actcttaaat 


840 


ttaaatttag 


tgttattttt 


gttgtcattg ttccttcttc 


ctcatgtggt 


tgtgcaggca 


900 


gagcttgagc 


atccagattt 


caaaattaaa aaataaaaga 


taatctagtt 


taatatatag 


960 


tagttgaatc 


accttaagtc 


tagactgctg tatgagcacc 


cattatcttt 


cactatattc 


1020 
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catcatcccc 


caacatatcc 


acagtagatg 


5 

aagggcagtt tgctcaaaca 


ttgttttgat 


n ft a n 
108 0 


cctgtcatgt 


ctgttcagaa 


atgcctgtct 


attcagaaac ccacgtctaa 


taacaaaatc 


1140 


ttggactggt 


tactatcaaa 


acccaacaac 


atacagactc ctcagctagg 


ccctagggat 


1200 


atttttctac 


cttgatttcc 


aaatgttcat 


tgaaagaatg cttaattcta 


atttggaaaa 


1260 


aagtttttgg 


cttcccactt 


ctgctttaca 


cgttcatctt tcttgaaatc 


aaatccaatc 


1320 


caatctatat 


tctaagaacc 


tgctcaaatc 


ttggttcttc aaagctttcc 


ctggtatttt 


1380 


gcatttttgc 


tttgaatagt 


tccacgaagg 


ataacttctt actccttcct 


tcatctttct 


144 0 


gtatcttgca 


tatagtaaat 


attaatgact 


tgtttgcatt ttgttatcct 


aacttggcta 


1500 


taaagaaaat 


cagatgtctt 


caccagtcgt 


tcaaacttca ggtctgccta 


cagattcata 


1560 


gatggctgtg 


gatttttata 


attttgtcac 


aaagttagtg gtaactacag 


gttatctcag 


1620 


aatatctttt 


ttggcgtata 


aatttttttc 


ttttcctttt ttagacagtc 


tctctctctg 


1680 


tcgcccaggc 


tagagtgcag 


tggcgtgatc 


ccggctcact acaacctctg 


cctcctgggt 


1740 


tcaagagatt 


cttagcctca 


gcctcccgag 


tagctagggt tacaggcgcg 


caccacctcc 


1800 


atgcccagct 


cttttgtatt 


tttaagtaga 


gacagggttt caccatgttg gtcaggctgg 


1860 


tctcgaactt 


ctgacttcag 


gcaaccttag 


ccgcctcggc ctcccaaagt 


gctgggatta 


1920 


caggcacaag 


ccactgcacc 


cagccttatt 


accataaatc atcttgatgc 


tggtacctga 


1980 


taagattcta 


tttgcttttc 


tttattcata 


gagaccacaa acagatcgca gatccaggtt 


2040 


tctcaaactg 


gagcatctgc 


ttaattttcc 


cataaaatca gtcttattct 


ttctgacagc 


2100 


tctgagactc 


ctccggccac 


gactaggtgc 


tgtcctggag gaaacggtgg 


aggacggccg 


2160 


cacaaaaacc 


aatctacctg 


atgaaaactc 


cgttcccttc tcgccagaaa 


cataaaatgc 


2220 


gatggatacg 


ctcgtgc 








2237 


<210> 7 

<211> 777 

<212> DNA 

<213> Homo sapien 










<400> 7 

tggggtcgcg gcgaggtacc 


tttctgtttt 


aaaatgttca aataatgtat 


tttttaaaaa 


60 


tgtgtaatca 


ttcaaggaaa 


cgggaaaaca 


agtctaaata aaaccgggtc 


ctaaagttta 


120 


ctctaggagt 


cattattctt 


cctttgcagt 


ctcaattcta tttactccgt 


aacaagtaaa 


180 


ctgttttatc 


ccttcggctg 


ggaaggataa 


tttttaccct accaagatga 


tgacaatgcc 


240 


actgcagtgc 


tcacccagca 


agaaacacat 


aaatatgaag cagcgtcaaa 


gtctttctat 


300 
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gaagtgtatg caagaattga tttgcaatgc 



tactatacct 


agcagatcta 


aacttgagct 


taaaagtgat 


aggtaatgcc 


aatagaacaa 


ttcttccagg cactgtgata 


ggatacatat 


tcacccacca 


gcaggcagca 


ggaatgctgg 


gcccatgttc 


aagaagctga 


gaaagtgcac 


cagtaacata 


accatctctc 


ggtagcacgt 


aagtgacggc 


agctggcgta 


atcatgtcta 


<210> 8 

<211> 911 

<212> DNA 

<213> Homo sapien 




<400> 8 


cattcacaca 


gaacagctag 


ggtcttcagc 


actcggttga 


ctctcaaacg 


tcatttgtgt 


tgccatggca 


atctttagtg 


acctggaaac 


atccaacaag 


gactttacca 


ctctatgact 


acagttggct 


atggagatat 


tggaggagtt 


tgtgttgtca 


gtggaattgt 


ccatagcttt 


gtgcagtgtt 


atcatgagct 


cctctccact 


gaattcctga 


attaatgcat 


aaagactttg 


atgctgcttc 


atatttatgt 


ttgtcatcat 


cttggtaggg 


taaaaattat 


cttgttatgg 


agtaaataga 


attgagactg 


ctttaggacc 


cggttttatt 


tagacttgtt 


taaaaaatac 


attatttgaa 


cattttaaaa 


atcttatgaa 


ttcagaagaa 


gcttggaact 


tcatttctaa 


atgttttata 


atttctcata 


tcacatgcga 


g 
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attaattcag 


gaattcagtg 


gagaggctcc 


360 


catgataaca 


ctgcacaaag 


ctatggtaga 


420 


ttccactgac 


aacacaaact 


cctccaagaa 


480 


ctccatagcc 


aactgtagtc 


atagagataa 


540 


taaagtcctt 


gttggatgtt 


tccaggtcca 


600 


taaagattgc 


catggcaaca 


caaatgaaga 




ttgagagtca 


accgagtgct 


gaagaccatt 


720 


gctgttctgt 


gtgaatgttc 


gtcaacc 


777 


acatgattac 


gccagctgcc 


gtcacttaat 


60 


tgctaccgag 


agatggttat 


gttactgtct 


12 0 


cactttctca 


gcttcttgaa 


catgggctgg 


180 


gcattcctgc 


tgcctgctgt 


gggtgattat 


240 


gtatcctatc 


acagtgcctg 


gaagaattct 


300 


tctattggca 


ttacctatca 


cttttatcta 


360 


caagtttaga 


tctgctaggt 


atagtaggag 


420 


tgcaaatcaa 


ttcttgcata 


cacttcatag 


480 


gtttcttgct 


gggtgagcac 


tgcagtggca 


540 


ccttcccagc 


cgaagggata 


aaacagttta 


600 


caaaggaaga 


ataatgactc 


ctagagtaaa 


OOU 


ttcccgtttc 


cttgaatgat 


tacacatttt 


720 


cagaaaggta 


ctattttcca 


atgtttttcc 


780 


tatagtgttt 


tttgtttgag 


agtaacattt 


840 


tcaatgtcag 


aagtatcctg 


gaaacatatg 


900 



911 



<210> 9 
<211> 445 
<212> DNA 
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<213> Homo sapien 
<400> 9 



gccgcccggg caggtacatg 


tgcacttaaa 


tgtaatagca 


ccaacattca 


ttatattatg 


60 


aaggggatac 


tttataagaa 


ttataaatta 


tttttacatg 


attaaataat 


tttggcagag 


120 


taagtccgca ggacttaaat 


aaccagtcag 


ccttagtatc 


tacatctgga 


ccaggaagcc 


180 


ttgtgttaca gacacagatt 


ccctgtgaag 


ttctccaggg 


tgtaaagaag 


gccccaggga 


240 


ggtgccgacg ggatgtgagg 


gctgcctttc 


agctcaggcg 


agcctgcaag 


caacagggca 


300 


gttgggaagg gtcacaggca 


caggaaggga 


gcagtgggca 


accattgatt 


acatcaaacc 


360 


aggatggttt 


cttcttttta 


aagaaacttt 


tqtgagtgtt 


tacccacccc 


catccacaca 


420 


aatatgctct 


ctaggaaatg 


tgaaa 








445 


<210> 10 

<211> 1254 

<212> DNA 

<213> Homo sapien 












gtccgcttaa 


ttaaagatct 


tttttttttt 


ttttttttta 


ttgttaagca 


tatttgtata 


60 


ttttttacta 


gttatttcat 


acttgccctg 


aaagaataca 


cattcaaaaa 


gcttgaaatt 


120 


aggcaatgtc 


agtctcatca 


aacaaaacca 


gcattggaag 


cgaatattaa 


caaatatcag 


180 


aatgaaatta 


caaaatatac 


atctccagcc 


tcataaaaca 


tgaattttat 


aagcactttc 


240 


ataaacataa 


gaaaaatagc 


tttgacaata 


actatttgaa 


aacattatta 


ttaataatct 


300 


tgtatactgt 


acatgtgcac 


ttaaatgtaa 


tagcaccaac 


attcattata 


ttatgaaggg 


360 


gatactttat 


aagaattata 


aattattttt 


acatgattaa 


ataattttgg 


cagagtaagt 


420 


ccgcaggact 


taaataacca 


gtcagcctta 


gtatctacat 


ctggaccagg 


aagcctttgt 


480 


gttacagaca 


cagattccct 


gtgaagttct 


ccagggtgta 


aagaaggccc 


cagggaggtg 


540 


ccgacgggat 


gtgagggctg 


cctttcagct 


caggcgagcc 


tgcaagcaac 


agggcagttt 


600 


gggaagggtc 


acaggcacag 


gaagggagca 


gtgggcaacc 


attgattaca 


tcaaaccagg 


660 


atggtttctt 


ctttttaaag 


aaacttttgt 


tgagtgttta 


cccaccccca 


tccacacaaa 


720 


tatgctctct 


aggaaatgtg 


aaagtataag 


cttcagaaaa 


atgtttttct 


catcctttaa 


780 


tttctggttt 


tatgatcaaa 


tgttccgaag 


acactgcttc 


tttttgctca 


aaatagttga 


840 


cattgtcatt 


gcttctagtt 


tttcagctcc 


tactatgctt 


actatgtcca 


tgggtgccat 


900 


tgacgtcttt 


ggctatttat 


catctctaga 


aaagaaaaaa 


aaaacattaa 


aattttagga 


960 


tgtaggattc 


acaataaatc 


tcttaccaaa 


tcctccttaa 


atccctaaac 


tctctacctg 


1020 
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gctctgactc 


caggatcacc 


ctttcagttc 


tttccatgtt 


tttcttcgga 


aaatgtagtt 


1080 


tcctaacctt 


tctacccacc 


cttctcgtct 


actcttaggt 


ctgcagccac 


agtgatagtt 


1140 


ctaagttgca aacctgatca 


tgtccttcca 


gtacttagtc 


gttgaaaaat 


cttcctactt 


1200 


cccgcagatt gaagcctgaa 


ctctttaata 


ggcatctaag 


gacttggacc 


cgga 


1254 


<210> 11 

<211> 838 

<212> DNA 

<213> Homo sapien 












<400> 11 
gcccgggcag 


gtacttaact 


aattgggctg 


aggatgaata 


tatcagccac 


agcacattaa 


60 


agaatgagcc 


aaggattgtc 


atggttggtc 


actttttaaa 


gtattgatta 


ctgcaactgg 


120 


aaaatooaaa 

**3 W J J ****** 


gtgtatattg 


gtgacgccaa 


cctcagtttc 


tgagcactcc 


tgctctgtgg 


180 


tgagaatcag 


acaaaaattc 


atcggggtga 


aaaaggcatt 


acctgattca 


cacccttgtc 


240 


ttgctagccc 


tcttccattc 


atttctcaca 


cagcacttgc 


tctgttaaat 


cctctctctg 


300 


tctcagacca 


tgcttgcccc 


ttcaaagggt 


atggttcagg 


ctcctttcaa 


gacattggag 


360 


tttctctctg 


gggaaagaga 


gccccctact 


ggtttggctt 


cagtctaggt 


ccaccatccc 


420 


tctcgatctg 


gcatcttgga 


gattaattta 


aaaggcaagc 


tcaccacaat 


gtaagcctat 


480 


ggtctggcca 


accttgcttt 


gggaactgtg 


acaccaaagc 


ccccaggact 


atctgcctct 


540 


ccaggagcca 


gatagaatga 


catgcctttt 


tcctaattgt 


ccacattcca 


cccccaagcc 


600 


actgccactg 


tgggccaagc 


catccatctt 


gcaatcttca 


tctaaaacag 


ctctcatttc 


660 


atgccagttt 


tgtcaaacct 


gcaccgtcac 


aagatattca 


gaagatgaaa 


acgttagaag 


720 


acacccctga 


attaaaagca 


cttactagca 


gggggtggat 


tatccaaaag 


tgccgtgatc 


780 


ggttgtacct 


gggtttctca 


ccaatggaat 


cagctcctgg 


tgaacaagca 


tgtgggtg 


838 


<210> 12 
<211> 1033 
<212> DNA 
<213> Homo sapien 












<400> 12 

atttaaggct gtacttaact 


aatttgggct 


gaggatgaat 


atatcagcca 


cagcacatta 


60 


aagaatgago 


caaggatttg 


tcatggttgg 


tcacttttta 


aagtatttga 


ttactgcaac 


120 


tggagaatga aaagtgtata 


ttggtgacgc 


caacctcagt 


ttctgagcac 


tcctgctctg 


180 


tggtgagaat 


cagacaaaaa 


ttcatcgggg 


tgaaaaaaaa 


aaggcattac 


ctgattcaca 


240 
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cccttgtctt 


gctagccctc 


ttccattcat 


ctctctctgt 


ctcagaccat 


tgcttgcccc 


gacatttgga 


gtttctctct 


ggggaaagag 


tccaccatcc 


ctctcgatct 


ggcatcttgg 


tgtaagccta 


tggtctggcc 


aaccttgett 


ctatctgcct 


ctccaggagc 


cagatagaat 


cacccccaac 


ccactgccac 


tgtgggccaa 


agctctcatt 


tcatgccagt 


tttgctcaaa 


aaaacgtaga 


agacacccct 


gaattaaaaa 


aaaacgtgcc 


cagtgatcgc 


actgtaacat 


gcttcctgag 


tgtaatcaaa 


gcatgtggtg 


ttagaaacct 


tcagctgtct 


tgcttatgta 


acttttattt 


gattttgact 


tattaaatgc 


aaaaattggt 


egg 




<210> 13 

<211> 824 

<212> DNA 

<213> Homo sapien 




<400> 13 
acatatgaaa 


gtgacctcca 


aggggattgg 


agactatgtc 


tggggaaaga 


aeggattatg 


tcagagcagt 


gagctcagag 


gcccttctca 


ggaagtattt 


gtggaactca 


ctgcctcagt 


aaagaaaaaa 


gaaaagcaag 


qaqqaciqqtt 


ctctctggat 


ttgagttgaa 


gagcatccat 


tctcccttct 


accaccagaa 


agtccctggt 


tgggttttta 


attagegcat 


tctctatcca 


ttagattgtg 


ctttgtaatt 


ttgttgttgt 


ttaacctgaa 


tgttttgtta 


cttaaatatt 


aaacaaaaca 


aaaaaaaaca 


agggtggggg 


tgactggggt 


tctccgtccc 


catccccatc 
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ttctcacaca 


geactttget 


4" 4- f— a a 3 ^ #i 


300 


ttcaaagggt 


atggttcagg 


ctcctttcaa 




agccccctac 


tggtttggct 


tcagtctagg 


a *"3 r\ 


agattaattt 


aaaaggcaag 


ctcaccacaa 


480 


ttgggaactg 


tgacaccaaa 


gcccccagga 


540 


gaeatgeett 


tttcctaatt 


gtccacattc 


600 


gccatccatc 


ttgeaatett 


catctaaaac 


t~ r\ 

660 


cctgcaccgt 


cacaagatat 


tcagaagatg 


72 0 


cacttacata 


gcagtggctg 


gaattactcc 


780 


gggattttct 


cacccaaata 


ggcaactcat 


840 


ttttggggcc 


atatgeacca 


ggtttctatt 


900 


ccgtatgtaa 


atttattctt 


tttaaaaatc 


960 


tttaaaagee 


aaaaaaaaaa 


aaaaaaaaaa 


1020 








1033 


tgaatagtca 


taaggatctt 


caggctgaac 


60 


ccccattaaa 


taacaagttg 


tgttcaagag 


120 


ctgagacagc 


aacatttaaa 


ccaaaccaga 


180 


ttgggtaaag 


gatgagcaga 


caagtcaact 


240 


gagcaatcta 


gagcatggag 


ttgttaagtg 


300 


ttgagttgaa 


ggccgcaggg 


cacaatgagc 


360 


caggtctcag 


gtagtgcggt 


gtggctcagc 


420 


acatttaatt 


gtttgaaagc 


ctccatatag 


480 


tgetctatet 


tattgtatat 


gcattgagta 


540 


aaaaaacact 


gttatcctac 


aaagaaaaaa 


600 


taatcegteg 


ggcataagct 


gtcccctgtg 


660 


tccgtgacac 


acaaaaaagg 


ccagacggcg 


720 
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cacatgcact caccgaccag tacaaccaca 
gacagcagaa agcaatacag aaaacaacaa 

<210> 14 

<211> 1093 

<212> DNA 

<213> Homo sapien 

<400> 14 



acctagagtg 


gttggaccat 


cagatgtttg 


accttccctg 


agcttacatc 


actgcccttt 


agtagaattc 


catcccagta 


ccaaagccag 


agtctctaaa 


aactacccac 


agcagcattg 


tttgagagga 


aagtcctcac 


atcaatagta 


gaatactcat 


aaggatcttc 


aggctgaaca 


cccattaaat 


aacaagttgt 


gttcaagagt 


tgagacagca 


acatttaaac 


caaaccagag 


tgggtaaagg 


atgagcagac 


aagtcaacta 


agcaatctag 


agcatggagt 


ttgttaagtg 


ttgagttgaa 


ggccacaggg 


cacaatgagc 


caggtctcag 


gtagtgcggt 


gtggctcagc 


acatttaatt 


gtttgaaagc 


ctccatatag 


tgctctatct 


tattgtatat 


gcattgagta 


aaaaacactg 


ttatcctaca 


aaaaaaaaaa 


aatccgtcgg 


gcataagctg 


tcccctgtgt 


ccgtgacaca 


caaaaaaggc 


cagacggcgc 


aaggaaacga 


cgtaggcaca 


ccaacaaccg 


aaccacacct 


cac 




<210> 15 

<211> 428 

<212> DNA 

<213> Homo sapien 




<400> 15 

atgtagcgat ggcatggtca 


ctaatctgct 


gaggtgtttt 


tgctgtctta 


atttactaaa 
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caaggaaacg 


acgtaggcac 


accaacaacc 


780 


caaccacacc 


tcac 




824 


ggcaaaactg 


aaagctcttt 


gcaaccacac 


60 


tgagcagaaa 


gtctaaattc 


cttccaagac 


120 


ataggccccc 


taggaaactg 


aggtaagagc 


180 


gtgcagggga 


acttggccat 


taggttatta 


240 


catatgaaag 


tgacctccaa 


ggggattggt 


300 


gactatgtct 


ggggaaagaa 


cggattatgc 


360 


cagagcagtg 


agctcagagg 


cccttctcac 


420 


gaagtatttg 


tggaactcac 


tgcctcagtt 


480 


aagaaaaaag 


aaaagcaagg 


aggagggttg 


540 


ctctctggat 


ttgagttgaa 


gagcatccat 


600 


tctcccttct 


accaccagaa 


agtccctggt 


660 


tgggttttta 


attagcgcat 


tctctatcca 


720 


ttagattgtg 


ctttgtaatt 


ttgttgttgt 


780 


ttaacctgaa 


tgttttgtta 


cttaaatatt 


840 


aaaaaaaaaa 


aaaaaaacaa 


gggtgggggt 


900 


gactggggtt 


ctccgtcccc 


atccccatct 


960 


acatgcactc 


accgaccagt 


acaaccacac 


1020 


acagcagaaa 


gcaatacaga 


aaacaacaac 


1080 








1093 


cacggcgcag 


tgtggatgga 


tggtcgcggc 


60 


ttgcctaatt 


tttagttgat 


ttctctacag 


120 



WO 02/068633 PCTYUSO 1/436 12 

11 



agcttacctt 


gtgtagctat 


ttctctttta 


ctgttgcttt 


ggtgtgttct 


tgaagtttga 


180 


ttgattattt 


ttggtcattg 


agttcttagt 


gtagtttgct 


agatgcatta 


t taagatxcc 


o a n 


tttgtgtctg 


acacttgtgg 


tgagttgctt 


ttgaaatctg 


ttgagatttc 


catgtgaaga 


300 


actgataatc 


aggctcttga 


tctgcttccc 


taaattactt 


ttttagagcc 


caaaccccag 


360 


gtttagggtg 


agggtttgta 


atacaaagaa 


acacttccat 


cacttctgcc 


cttaaccctt 


420 


gcctctat 












428 


<210> 16 

<211> 6823 

<212> DNA 

<213> Homo sapien 












<400> 16 
aaacttggag 


agacgcagga 


caggatcccg 


gcggcagaag 


gacggagaga 


aaggggaccc 


60 


cgggacggga 


aaggcgcaga 


gcaggcgcgg 


gcggcggcgg 


cggcggggca 


gggcagggcg 


120 


ggcgtcccgg 


cagagggcgc 


gcggtcgccc 


tgtcgccctc 


cgccccgccg 


gggtcacagt 


180 


gccccctccc 


tcgcgcccta 


gccgccctgc 


cgggctattt 


ttacgcgcgg 


acaccggaca 


240 


ccggacaccg 


ggctggggcg 


gcggcggcgg 


cggccgaggc 


ggccgaggcg 


gggccgcacc 


300 


ggggccgggc 


gtcggggcca 


cacgtcggtt 


cgcgggtcgc 


cggggctgcg 


cgcgccatgg 


360 


agccgcggtg 


cccgccgccg 


tgcggctgct 


gcgagcggct 


ggtgctcaac 


gtggccgggc 


420 


tgcgcttcga 


gacgcgggcg 


cgcacgctgg 


gccgcttccc 


ggacactctg 


ctaggggacc 


480 


cagcgcgccg 


cggccgcttc 


tacgacgacg 


cgcgccgcga 


gtatttcttc 


gaccggcacc 


540 


ggcccagctt 


cgacgccgtg 


ctctactact 


accagtccgg 


tgggcggctg 


cggcggccgg 


600 


cgcacgtgcc 


gctcgacgtc 


ttcctggaag 


aggtggcctt 


ctacgggctg 


ggcgcggcgg 


660 


ccctggcacg 


cctgcgcgag 


gacgagggct 


gcccggtgcc 


gcccgagcgc 


cccctgcccc 


720 


gccgcgcctt 


cgcccgccag 


ctgtggctgc 


ttttcgagtt 


tcccgagagc 


tctcaggccg 


780 


cgcgcgtgct 


cgccgtagtc 


tccgtgctgg 


tcatcctcgt 


ctccatcgtc 


gtcttctgcc 


840 


tcgagacgct 


gcctgacttc 


cgcgacgacc 


gcgacggcac 


ggggcttgct 


gctgcagccg 


900 


cagccggccc 


gttccccgct 


ccgctgaatg 


gctccagcca 


aatgcctgga 


aatccacccc 


960 


gcctgccctt 


caatgacccg 


ttcttcgtgg 


tggagacgct 


gtgtatttgt 


tggttctcct 


1020 


ttgagctgct 


ggtacgcctc 


ctggtctgtc 


caagcaaggc 


tatcttcttc 


aagaacgtga 


1080 


tgaacctcat 


cgattttgtg 


gctatccttc 


cctactttgt 


ggcactgggc 


accgagctgg 


1140 


cccggcagcg 


a ggggtgggc 


cagcaggcca 


tgtcactggc 


catcctgaga 


gtcatccgat 


1200 
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tggtgcgtgt 


cttccgcatc 


ttcaagctgt 


cccggcactc 


aaagggcctg 


caaatcttgg 


1260 


gccagacgct 


tcgggcctcc 


atgcgtgagc 


tgggcctcct 


catctttttc 


ctcttcatcg 


1320 


gtgtggtcct 


cttttccagc 


gccgtctact 


ttgccgaagt 


tgaccgggtg 


gactcccatt 


1380 


tcactagcat 


ccctgagtcc 


ttctggtggg 


cggtagtcac 


catgactaca 


gttggctatg 


1440 


gagacatggc 


acccgtcact 


gtgggtggca 


agatagtggg 


ctctctgtgt 


gccattgcgg 


1500 


gcgtgctgac 


tatttccctg 


ccagtgcccg 


tcattgtctc 


caatttcagc 


tacttttatc 


1560 


accgggagac 


agagggcgaa 


gaggctggga 


tgttcagcca 


tgtggacatg 


cagccttgtg 


1620 


gcccactgga 


gggcaaggcc 


aatggggggc 


tggtggacgg 


ggaggtacct 


gagctaccac 


1680 


ctccactctg 


ggcaccccca 


gggaaacacc 


tggtcaccga 


agtgtgagga 


acagttgagg 


1740 


tctgcaggac 


ctcacacctc 


cctagaggga 


gggagggagg 


gcagggtgga 


gggcaaggct 


1800 


ggggggaggg 


gattgggttt 


aggaagagct 


aggttaagtc 


gtaacgagtg 


gggaaacact 


1860 


gagtcttgtt 


gggtcttggg 


ttgtgtggtt 


tggtagctcc 


tgtgggtacc 


tcctgaagca 


1920 


gcagcgaatg 


gcaatgggtt 


gtgttgtgtt 


aatgaagact 


caattggttc 


atattactct 


1980 


gagttgtgca 


aagctcatgg 


agccttttgg 


ggtagtgttg 


agataggttt 


ggtcgtatca 


2040 


ttttgtgagt 


ttcctaggtc 


agtgttgggt 


ttggttgggt 


tgtgagtctg 


ggatagtgtg 


2100 


gtccagctgc 


attgtgtagg 


attctgtggt 


ttggtgggtc 


ccctagggcc 


atgttgggtc 


2160 


aagttagatg 


gtcccccatg 


gcattgttga 


gatcgaatgt 


gtgtggtgtt 


aagtttcgtt 


2220 


gagacatggt 


ggaaattgtg 


tagctctgtg 


attcttccag 


gggcatgtta 


ttttaggttc 


2280 


tgtgaacttg 


cgagtcatgt 


agaaatgtga 


agagtccagt 


ggtagaattt 


gagctttcta 


2340 


ggtcacattg 


ggttaagttt 


gtatgaccaa 


atgaatcttg 


tagggttctg 


ttgggcttaa 


2400 


ctgtgtagag 


gtgtgtggct 


ggacattttt 


cgtggccaca 


gcgagttgag 


ttgtgttgaa 


2460 


ttgtacaacc 


atatgagcct 


tgtaaggcca 


gttcagttgg 


gtcatgccac 


tgtttgagtc 


2520 


tcatagggcc 


atgctgaatt 


gagttccatt 


gagttgtgtc 


actatgtgag 


tcctacagga 


2580 


agttgggttg 


agttggactg 


tgcgaacgag 


ttccataggg 


ccacatcggg 


ctgttttgca 


2640 


tttagtggta 


gcaccaggac 


ccaaaggaaa 


tagcagtggg 


gaagcatcat 


gtatctggga 


2700 


gcatgcagtg 


gcgagggctc 


tgggaggtgt 


gccgagctgg 


ctccccagct 


cgctgtaggg 


2760 


ggcgggactg 


gattctgtat 


ccatgggatt 


gggtgttcat 


ccagaggcga 


ctgggtaaat 


2820 


taggaagagg 


tggatgctcc 


tcctgtttac 


cccacatcca 


cttcattgtg 


ctgttcactc 


2880 


ccattctccc 


ctacagtttt 


atgctcagac 


atggaggtca 


gagccacaag 


ggaaagggga 


2940 
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aaactgtact 


ctgtccagac 


atqataqaqq 


gaeagageca 


aaaggataga 


3000 


gaaagagacc 


cagaaaaagg 


aaqaqqtqqa 

3 3 3 "33 1 * 


aacccagaga 


gacagagacc 


caaagggaga 


3060 


gaaacagaga 


etc acrcrcraQa 


qaqaqacaat 


qacctqqaqq 

3^ 33 33 


qtqqqqtatq 

3 "3333 3 


geagagaege 


3120 


aaaaaaaaaa 


aacagaaatc 


cacraQwQQQQ 

»■* v 3333 


agacagagac 


caaqaqcaqq 

3 3 33 


qqatagaagc 


3180 


caaacaaaat 

333^3 3 


aacccatqcc 

3 3 3 


tgtaatctca 


Qcactctqqq 


agaccgagga 


aqgqggattg 


3240 


attgaggcca 


ggagttcaag 


accagcctgg 


gcaacatggt 


gagaccccat 


ctctacaaaa 


3300 


aatacaaaaa 


ttagctgagt 


qtqqfcggcac 


atgcctgtga 


tcccagctac 


tcaggaagct 


3360 


gaggcagaaa 


gatcccttga 


ccctgagagg 


tagaggctgc 


attgagecat 


gattgeacca 


3420 


ctgcactcca 


QC c t qqqc a a 


caqaqqqaqc 


ccccgtctca 


acaaacaaac 


aaaaagagee 


3480 


a at* crercrcrcr a a 
a y ^yyyyy 01 y 




aaacccaaaa 


ggcagegtea 


gacacccaga 


gt tggagaca 


3540 


aaacaacaaa 


crtctcacraaa 


aagagaacca 


caatagaaaa 


aggcagaaaa 


qqccqqqcqc 

33^^333^3^ 


3600 


aatoactcac 


gectgtaate 


ccagcact tt 


qqqaqqccqa 

y33 u 33^^3 w 


qqtqqqcaaa 

33 u 333 wt * uw ' 


ttacgaggtc 


3660 


aaaaaatcca 


gaccatcctg 


getaacaegg 


tgaaaccctg 


tgcctactaa 


aaattcaaaa 


3720 


aattaQCCcrQ 


crtQtactQcrc 


atgtgcctgt 


aatcctagct 


act tqqqaqq 


ctgaqqeagg 

3 33 3 3 


3780 


agaatcacfct 


gaac c t ggga 




qcacjtqaqcc 

3 v**3 *-3**3ww 


aagatcaege 


cactgcactc 


3840 


cagcctgggc 


gaeagagega 


gactctgtct 


caaaaataaa 


tgactagata 


aaaaaaaaaa 


3900 




acatt ttccc 

W* v* WWW W v W W 


tgcattttag 


aga 1 1 aggac 


agt ggt c ac t 


ttctagttgc 


3960 


cgtcttcaca 


gaeaaaefctg 


aaaggtatta 


aggaaaaaat 


tccacatagc 


tetgggcett 


4020 


ttttgtattc 


tgattgecac 


gttaatctgt 


aatcgttaag 


tacccatgaa 


agaatgtgtt 


4080 


tcagtatcag 


tgcttttttg 


gtactttaac 


aagtattgta 


qqc acqgt qq 

j3 33 3-3 


atttttttga 


4140 


aatacccatg 


tgtccaatag 


caaqttaaqq 


aacctagcgc 


atcaaatttt 


gttttcttca 


4200 


ttcatttgga 


accttatttt 


cgcaaaaaac 


ctagctggaa 


taataatgee 


atattgtatt 


4260 


ttgcacactg 


ctttattttc 

W WWW %*L W W w W v 


t a aa acr c t* r 1 1" 

bWjuHMv www 


aaaaacaaat 

333 3 vnou 


t acatt cat c 


acattacctc 


4320 


tactctctca 

w *H W W W W W ^ C4 


a rra ¥• t~ a a a crt" 
a^ju w wcicmM «• 


V*tl» WWCl^ WGICI 


cattctttat 

W> W* W W W WWW GL W 


tgtcatcata 


aggaaaaact 


4380 


ttfcatcgcaa 


taataotcacr 

W^H W w W 


lv k> W C* w CL I* l» C*y 


tttagcacta 


tagatagtat 


tcagaaattg 


4440 


ctttctcttt 


ggtaaagaag 


ataaataggt 


gaattttatt 


gcctgtgttt 


tcaggcttca 


4500 


gaggtacctg 


gcaccttact 


caagcaaaaa 


caaaaaacag 


ctttacttca 


aaagcatgtg 


4560 


cctctggtaa 


attatccatc 


acttttgttt 


taaagtggtc 


cctccaagaa 


taatgtggca 


4620 


tacacagtgg 


gtttggttgt 


aaggcagatg 


taatagatga 


cttcaaaagc 


ctgctcctta 


4680 


ctgcactctg 


gacaaggccc 


tggtctgtat 


agtttggctt 


cctctgttaa 


gtcattctac 


4740 
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aaggtgactt 


gccatttgaa 


ctttcgtaag 


agacctcagc 


agttagactg 


agaaaagtta 


4800 


gagtgctgta 


gttatgaagt 


catacatttt 


tgataaaata 


cctgtggaaa 


accagggatc 


4860 


ctctcctgtt 


tctctctctt 


gcctccccct 


ttccctccat 


tcctctgtcg 


cctccccctc 


4920 


tcctccactt 


tctttaagaa 


tcagggggta 


cagaataacc 


tacaaacaac 


ctcttcaaag 


4980 


tacccttgga 


aattgaagtt 


caatcttaag 


ttttttgctg 


tctttaattt 


actagaattg 


5040 


cctaattttt 


agttgatttc 


tctacagagc 


ttagcttgtg 


tagttatttc 


tcttttactg 


5100 


ttgctttggt 


gtgtcttgaa 


gtttgattga 


ttatttttgg 


tcattgagtt 


cttagtgtag 


5160 


tttgctagat 


gattattaag 


attcttttgt 


gtctgacact 


tgtggtgagt 


tgcttttgaa 


5220 


atctgttgag 


atttccatgt 


gagaactgat 


aatcaggctc 


ttgatctgct 


tccctaaatt 


5280 


acttttttag 


agcccaaacc 


ccaggtttag 


ggtgagggtt 


tgtaatacaa 


agaaacactt 


5340 


ccatcacttc 


tgcccttaac 


ccttgcctct 


attacctaag 


cttagtaagt 


agttttctta 


5400 


atattgtcca 


atgtttggtg 


ttatctgcat 


tgttttttgt 


ttttttaatt 


gtggttgttc 


5460 


tgacgtgtga 


aatttaagga 


aagagccatt 


ttttaaaaat 


gccgagcagg 


gcaatgcaag 


5520 


tacagccaaa 


tattagactt 


gatgtgcaat 


cttcggtcct 


ttttaatctg 


gggtattata 


5580 


ggcagtactt 


taaattgcaa 


agtcttccgg 


gcctattttc 


ctctacattt 


ttgtaattaa 


5640 


ctctgggggc 


ttacttgttt 


tggcagtact 


gaaatcaaag 


gagctggttc 


ttcttttctc 


5700 


ccaattattt 


tcatatgaaa 


gcacctacaa 


ttagcctgtt 


agtcctattc 


agatacatca 


5760 


aatatcagtg 


aatgctttac 


tattcgcaca 


tttaagcatc 


tttgttttac 


ataaaattag 


5820 


agtatgaaaa 


ccagtgttca 


attttttatc 


ttgttgagct 


tgtaaaatgc 


cagcaattta 


5880 


aaactaggac 


ttttcccccc 


ataagccaag 


gaggtagaat 


tactaataca 


agggttaaag 


5940 


aaggtagatt 


ttgttttcaa 


tatttgggta 


atattagaaa 


gattcttccc 


acagggaaga 


6000 


actagcaagt 


gtcccaattt 


tttccaaacg 


ttggggaggg 


gaaaattcac 


tgtatcatga 


6060 


aaccctaagg 


gtttgttgca 


cttcctgctt 


tttaggcctg 


gataacagta 


tcaccatcct 


6120 


tatttacaga 


agggtaaaac 


tgactcttaa 


tgagaaaagc 


tttataagtt 


caagggctgt 


6180 


aaaatatgaa 


ctacttaggg 


tcgtttgcct 


tccatgggaa 


cttggctaga 


cttagaaaaa 


6240 


gctgtfctgtt 


gtgctaatgt 


aaaagtgtca 


tacaatttag 


aagatttttg 


aagatggtaa 


6300 


acttagaaga 


attctatgtt 


ctgaaatgca 


cacttttaga 


atgtttttct 


ttgaaaacag 


6360 


gctaatagtt 


ctttcttttt 


ttgacaaagt 


ttcagctcct 


ctttaaagtt 


attgtgtcat 


6420 


ttttcbggtt 


taaatttccc 


ttatatttcc 


acctgtaatg 


tcagtggcaa 


caacatcata 


6480 
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cttcacttac 


ttgttaattt 


acuutttacc 


15 

cttattctaa 


gagcaagttg agttgaactg 


6540 


aatcttcctt gtcttagtaa 


caatgtataa 


atagtggctt 


ttctgtacaa aaggttgtaa 


6600 


tgcctcctga 


tggatataat 


tttgtgattg 


tatttaaaag 


ttgaataaat 


cacaccagct 


6660 


tcctgaaaat 


gttcataatg 


catcttttgg 


aaaacaaata 


catgcctact 


ttgtgcatat 


6720 


ttgcattaac 


atggcaaaga 


ttgtatgaaa 


tacctgtttt 


tcagaaaata aaggttcagt 


6780 


ctcaaaagat 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaggcg 


gtc 




O06J 


<210> 17 

<211> 984 

<212> DNA 

<213> Homo sapien 












<220> 

<221> mis cofeature 
<222> (211) . . (456) 
<223> a, c, g or t 












<400> 17 
tggccctttg 


agatttccta 


ccxcaccgtt 


acttcagttt 


acccttgcag ggggccaggg 


ou 


agtcaagaat 


ataccgtgtt 


cctccagggt 


ttaagccggc 


catgccttcc 


cgagagcata 


120 


accaacttga 


caggggtgcc 


cagttacccc 


acaaactgaa 


ggaaggagat 


ccttcccccg 


180 


tccccaggag 


tgctctcaac 


cagcctcaga 




nnnnnnnnnn 


nnnnnnnnnn 


240 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


■3 n n 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 






nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 




nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnccct 


aaccggccta 


ccacgacccc 


a on 
4 oU 


actgaacagc 


gcatgacgcc 


acgagaccgg 


aacgacaagg 


cggaccaaga 


cagaaatcag 


c a n 
540 


actcaaacac 


aacagaccaa 


gataaaccgg 


caaatccacc 


acagaaatcc 


agggcagaaa 


£ f\ rt 


aaccgacagc 


aacagcacaa 


actcgtgaga 


ccaaaagagc 


gacacaaaag 


ggacaaaggg 


DO U 


aaacacgaac 


agaaaagcca 


gcctatgacc 


agcacgacac 


aaccccaaac 


gccaccagag 




cacccagacc 


gagccataga 


caacaacacc 


gacagccata 


acgtacggac 


tgctgccgac 


iron 


aacccaaccc 


agcacaaata 


aagacgcacg 


gaaccccaca 


tgcagagcct 


ccactgaatg 


840 


gcgccacccg 


cacggacacg 


acagcaaccg 


ccgcaatcag 


accaagcact 


ccccaacaga 


900 


ccacggagag 


acccgcaaca 


agcggcagca 


caagacacag 


atcggacaac 


accaagaaga 


960 


taggagagag 


ccccaggtaa 


caac 








984 
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<210> 18 

<211> 186 

<212> DNA 

<213> Homo sapien 



<400> 18 

ggtcgcggcg aggtacaaac 


gtgctgtaat tataagtctg 


tggtgcagtt 


ccattttatg 


60 


ctacacatta aggcttttaa 


ttttgtgtaa aaggaatttt 


tggaatagtc 


ttcattgaga 


120 


ctctttttaa tcttatactt 


tctaatttag aattataaga 


cattcaattt 


gaattataga 


180 


aaaatt 








186 


<210> 19 

<211> 418 

<212> DNA 

<213> Homo sapien 










<400> 19 

gatttgacta tatagccatg 


gtccttaatc atgctgagcg 


gcgcagtgtg 


atggatcgcc 


60 


cgggcaggtc acattggaat 


ttgaatgcag tggccaggac 


agcagcttga 


taaaccacct 


120 


ta fcacrcrfcaocr taaacaaacc 


acgggattcc cfccggctggg 


ctggtgatgg ggaggggcgc 


180 


agtggacagg aagcgggcag 


ggcctgagga ccctgtactt 


taatgggcca 


caacagggct 


240 


cagctcctct cagaacttct 


gctacaatca gagttaagac 


cagatacatg 


ctacctaagg 


300 


ccatgaactt gacaatatct 


gctccagaga agccctgagc 


cctcgcccta ggtccctatg 


360 


attataaacc caatggtaag 


cccaagtctc acctttctaa 


atattctgtt 


tatcagtt 


418 


<210> 20 
<211> 1811 
^212n DMA 

<213> Homo sapien 










<400> 20 

atcccgctaa tctgggggcc 


caatctcccc ccaggggctc 


ttccaagaca 


ccgaggcttt 


60 


cgctcgtccc tgtttaacag 


cctgcggtgt cttttatata 


tacgctccct 


gggggcccgt 


120 


gtcaccacaa cgtgtggtag 


tgaaatagtg gcgcctctgg 


aggaaattcc 


ttgatggcgc 


180 


cggcctcttg tatgtgttag 


aagacgctct ttactatggt 


gtggtctagt 


actaattgct 


240 


gattcatcga tgcggccttt 


tttttttttt tgacggttcc 


tatataacgt 


ttatttctgg 


300 


aagttaaagt agatacagca 


atataccaaa aaaaaaaaaa 


aaaaaaaaaa 


agacaaaaaa 


360 


cctcacaata atataaattt 


ttacactatg aagtacacat 


tggaatttga 


atgcagtggc 


420 


caggacagca gcttataaac 


caccttatag gtaggtaagc 


aacccacggg 


attccctcgg 


480 
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ctgggctggg 


gatggggagg 


gggcagtgga 


tctttaatgg 


gccacaacag 


ggctcagctc 


agaccagata 


catgctacct 


aaggccatga 


gaaccctcgc 


cctaggtccc 


tatgatttaa 


taaatattct 


gtttatcagt 


taagcttatt 


gagctctgca 


tgacacatca 


actgtctcca 


cctagagaag 


gtcgtctccc 


ccttaattca 


attccacaga 


aactctgatc 


agctgaggca 


gcctctgagt 


cagaggcagc 


ccatccagca 


ggcqaqctaq 


ccagccaagg 


cqqqcaqqcq 

w 333 


qtqqqqcCCC 


aqcaqqqaqq 


agtcaccagc 


acqaqqqcaq 


actcaagttt 


atcagtaagc 


cattacaagt 


gccactggat 


atcaccaaga 


accagacacg 


tcgctggggc 


cccatagtgt 


ctcttcttcc 


atgaggtcct 


gagaccagga 


agacacgcaa 


ggagacatgc 


ctctagcagg 


caqqaaqqaa 


gccagaatca 


gaagtatccc 


tagcctctga 


aacagcagat 


caagtccagg 


ctgcccagcc 


ttactcacta 


aaaggccaat 


aaaaaaagtc 


taaaatgctt 


agattctcct 


ggttgcagct 


tcagatgtct 


taagggtttg 


ttaccagcct 


gaaggaagct 


gaatgcttaa 


tttttttttt 


t 




<210> 21 

<211> 602 

<212> DNA 

<213> Homo sapien 




<400> 21 
cggccgccgg 


caggtacagg 


tttttttggt 


cttcatgaaa 


acaccattgt 


tactaacttg 


caagttatgg gacttgtcta 


aatgtgtcta 
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caggaagcgg 


gcagggcctg 


aggaccctgt 


540 


ctctcagaac 


ttttgctaca 


atcagagtta 


600 


acttgacaat 


atctgctcca 


gagaagccct 


660 


acccaatggt 


aagcccaagt 


ctcacctttc 


720 


atgtactgaa 


gagtgttgct 


ggagaagtaa 


780 


ggcaggagga 


ctggggcgaa 


ccctgttcat 


840 


gagaccagct 


aatttgattt 


aacaaacaga 


900 


aggtggtttt 


gagttgccaa 


atccggccat 


960 


cgtgctaggt 


gttcccatac 


gcacaggaga 


1020 


gggaggccct 


ctagctgttc 


tgcctcacct 


1080 


cfccagaggga 


ggccaggtat 


acaccctcat 


1140 


ctttgccctg 


catgaactga 


atgaagacgc 


1200 


aactggctaa 


gaaccatatt 


cctgagctca 


1260 


gcatcatgtc 


caacctgtaa 


ctctctcccc 


1320 


ttcctaaaac 


aaacaggatg 


agggaccttt 


1380 


atcaggggga 


ctggggtcgg 


gagggtgggg 


1440 


agccctgata 


aggcacccac 


agaaacaacc 


1500 


gagaaaggga 


gtcaaggttg 


taagcaccct 


1560 


acattacaaa 


ggaaaataag 


tctatttata 


1620 


taaaccttcc 


tatttcaaca 


ccaaaggcca 


1680 


acctgaagcc 


cgtttcaggg 


attatataaa 


1740 


agcaccaagg 


aggctgttaa 


ctgaagtttc 


1800 








1811 


aatgcgattt 


aaaattttta 


aaaacatggg 


60 


ggcttagatg 


ggaatctaat 


gtgaatagtt 


120 


tactctaaac 


ctgggggaag 


gtggtgtgga 


180 
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acattgactt gactacagtc gaatacatgt tgggaatcat tcccaatcat aagacatcgt 


240 


cgtttctgta ggatgactgc atatgttcag agtagctttt tgaatttggt tcccttagct 


300 


ttagggatgt gatgttatag tccaaaatgt ttacaagaaa acctaagtct tcaaaagcac 


360 


aacttttgtt gcttaggact ttgcatcaac tgttgttcca gacctcactt caacttcttg 


420 


gtctctgaac 


tggttttagc tagcatgcat gagagacagt tttcatgtat gctagctaaa 


480 


accagttcag 


agacagtttt catgtataat gtttctgctc tgcaactgga aagtggttta 


540 


ctttgcacaa 
aa 


atgctgatga aaacttcatg actatgagac ttttcttctg tgtatcaaga 


600 
602 


<210> 22 

<211> 4114 

<212> DNA 

<213> Homo sapien 




<400> 22 
tttttttttt 


aattaggata atgcctttat taacgaaaat gaaacgttca ttcctccttc 


60 


cactcctttt 


cgttggtttt ttggacacag ctcacctgat cctgctaaaa acgttgtcag 


120 


tctgcttggg 


gcttccctcc ttgattgact cacgccgggg gacgtcccga aaagtactta 


180 


tccacttcat 


gggaatgagc cctccaatat cagccaacat caatcattct tacctaaaga 


240 


ataataagaa 


aaagttaata taaaagacaa gggtataaaa taaaggtttg aaaatgctag 


300 


tcaacttcaa 


aatttaaaga gtaaaaatcc agagataaag attgggggta agttacagca 


360 


taaaaaaata 


ggaagaaact tcatggtggg ggggaaatct aaaattattc ttacataaaa 


420 


taagtagaca 


cctgaattag aatgaaaact gtattttctt taaaatgtaa aagcctgact 


480 


ctcagtttca 


ccagtctgag cacaagtttg actgcaaccc aaaatatact atcccttatg 


540 


tgaaggtatg 


tgacaacgtt gacctcacca aatgagtttt aacatcagct cttttttcat 


600 


atgaaagcac 


ataccctgct ccccattcaa gtatgtcttc cattgtcagg caggctgacc 


660 


accttcagca 


ggagtcctcc aagagtgccc aactcccctt cccacagtac acaacgctgt 


720 


agttgttgtc 


ctgcaatcct ttgtatttac ctcattcttt cccatctaag tcctcactga 


780 


gttttaaagt 


tagggctgga aaagctatgc cttactggga cagcaaggaa ccaatttttt 


840 


tctgagggag 


aagacattca ccttcactat atgcctggca gggccacagt gcacaaaaca 


900 


aagatcagcc 


ttcattcaag ttccaggttt ttcttcctcc ctgaatgatt actgcaaagg 


960 


gtatatgaag 


taagagttcc ctgttgcaca tgtaccatcc ataagggata ctatatcgtt 


1020 


ttgcattctt 


ccccccattc tccacattgt cctatcttaa gtccaagccc ttttcactct 


1080 
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caaaaaaaaa 


aaaaaaatat 


ttttttcagc 


actggtgttc 


aaaagcaacg 


tttttatggt 


1140 


taatggttta 


ccagcaactg 


ttgagatttc 


cagttgagtc 


ttaaaaattg 


ccaatcatta 


1200 


tctagcagca 


atgacagatg 


attaggagca 


gtcaaatcct 


ctgaattctt 


tccctaatag 


1260 


gcagccattt 


gagaactgca 


ctagctgaca 


tcactaaaac 


attatcagct 


aaagccaaaa 


1320 


ccaaataaag 


gcccagacca 


acatcctggc 


tctctaaaac 


ctgtccaaaa 


tcattaagtg 


1380 


aaaggcagta 


aatgcaggac 


tgtggatcat 


gtcactgcag 


ctgacaatga 


ttaacaatag 


1440 


gagacatgca 


acccccatta 


aggttaaaag 


tccaaaacta 


gtcacacgca 


tctctttatt 


1500 


ggggaaaagt 


gagactatta 


tgcattcttg 


gtaggtttgc 


aaccttgcat 


gaagagcacc 


1560 


cattgcattt 


ctttcatctt 


tcagaaagca 


ccggtatctg 


ttccaagggc 


ctaacagtac 


1620 


gaaaatacat 


tctggcatca 


cacctctgaa 


cccaagactg 


ttctcattaa 


aaataatttt 


1680 


ggtttgtaac 


aaaattatga 


aatacaatgc 


aagcacctcg 


gtatagcatt 


attactgaaa 


1740 


ccacttaatt 


cccagctttt 


tgagtttttt 


aaaaaaaccc 


actgcactaa 


gattcacaat 


1800 


tcattgctac 


atacaaatta 


aagctagtaa 


gaacacacta 


acgtcacaag 


tttctcattc 


1860 


taaagtgcaa 


aagcctaatc 


atctgaaagt 


gaacagggta 


aggcaaaatt 


aaccccccac 


1920 


cccaataaag 


ttcctgaagt 


ccatatatta 


tataccaagt 


acattctcta 


aaaattgtta 


1980 


ctgactggta 


agaaatagac 


ctgagttttt 


atttctaaca 


cccaatcact 


aaaccacggc 


2040 


agcaagcact 


ggccaccgat 


ttaatggatt 


acgacacagg 


aaaccccatc 


agggttctat 


2100 


gtaatttagt 


gatactcatg 


tcactaatat 


tgagcattat 


acttgatctg 


cattatattg 


2160 


ttgatatgca 


gaggctaaac 


tagtcatcat 


ttgctctttc 


atctatcagt 


agagtccaaa 


2220 


gttgtttgct 


tgaatggact 


acatgttaaa 


gtacaagtct 


gtccccacct 


tgtgaattgc 


2280 


ttgccaacga 


gcaagctttt 


tcttgataca 


cagaagaaaa 


gtctcatagt 


catgaagttt 


2340 


tcatcagcat 


ttatgcaaag 


taaaccactt 


tccagttaca 


gagcagaaac 


attatacatg 


2400 


aaaactgtct 


ctcatgcatg 


ctagctaaaa 


ccagttcaga 


gaccaagaag 


ttgaagtgag 


2460 


gtctggaaca 


acagttgatg 


caaagtccta 


agcaacaaaa 


gttgtgcttt 


tgaagactta 


2520 


ggttttcttg 


taaacatttt 


ggactataac 


atcacatccc 


taaagctaag 


ggaaccaaat 


2580 


tcaaaaagct 


actctgaaca 


fcatgcagtca 


tcctacagaa 


acyacyaty c 






ggaatgattc 


ccaacatgta 


ttcgactgta 


gtcaagtcaa 


tgttccacac 


caccttcccc 


2700 


caggtttaga 


gtatagacac 


atttagacaa 


gtcccataac 


ttgaactatt 


cacattagat 


2760 


tcccatctaa 


gcccaagtta 


gtaacaatgg 


tgttttcatg 


aagcccatgt 


ttttaaaaat 


2820 


tttaaatcgc 


attacaaaaa 


aacctgtact 


tttagttcaa 


ctcaacttgt 


agaattacca 


2880 
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agattgcata 


atgaaattac 


tgatattgcc 


gatctatggg caggtcagtt tgctacaata 


2940 


gagactaatt 


atcacatgct 


atacggtcca 


tgtcaaggtg ctaaaagcac cctagttccc 


3000 


aagtatagtt 


tagttccctc 


tcccccacac 


cactgatgtg ttccatgtta tcttcagtta 


3060 


caatgcaact 


aaaggaaacc 


acaactgagt 


cagatatacc aaagaatcaa gttgcacttt 


3120 


ttatctgaga 


actgcaacag 


cactgaattc 


tgcctgacaa attacagctc taaccccaca 


3180 


cccacacagt 


tttgatgtaa 


gctagcttta 


ccatacaagt gttaggtgct gcactgtaat 


3240 


ttcatgtcag 


aaatgtgatg 


ccagaatgcc 


cacggaataa aagtacatac aagtcaccaa 


3300 


gttagattat 


atgcttgtta 


cctacctgta 


tgcagtcggc aatgagaatc ttggagcaag 


3360 


caggaatact 


acatccgggt 


cctaatgtcc 


attgccattt gcggtactac gttcctcaca 


3420 


gttacgcact 


gcagaaatgc 


tggctaaatg 


cagttatgta gcaggccact actttaatag 


3480 


tgcataattg 


cagtccaaga 


acaccagaaa 


acattccgcc acaacttagt ggcttgccca 


3540 


agaaaagcca 


agtatctaaa 


ttttaatctg 


ccataatatg ccacttaaaa attgcacagg 


3600 


cgtaacatta 


caatttcccc 


attttttagc 


tgtttatatt agtggtacaa tacatctata 


3660 


aagagtggtg 


gggttaggtc 


tgtaatttgt 


caggcagaat tcagtgctgt tgcagttctc 


3720 


agataaaaag 


tgcaaacttg 


attctttggt 


atatctgact cagttgtgtg gtgtccttta 


3780 


gttgcattgt 


aactgaagat 


aacatggaac 


acatcagtgg tgtggggtta ggtctgtaat 


3840 


ttgtcaggca 


gagtgaggtt 


tgttgtggag 


ctggcagatc caaagttgga ggtgaaatgg 


3900 


tataaaaatg 


gtcaagaaat 


tcgacccagt 


accaaataca tctttgaaca caaaggatgc 


3960 


cagagaatcc 


tgtttatcaa 


taactgtcag 


atgacagatg attcagagta ttatgtgaca 


4020 


gccggtgatg 


cgaaatgttc 


cactgagctc 


ttcgtaagag agcctccatt tatggtgccg 


4080 


agcagctgga 


tagaaacccc 


cgctgattgt 


tgtt 


4114 


<210> 23 

<211> 234 

<212> DNA 

<213> Homo sapien 








<400> 23 
agtgtttgca 


acagcaccat 


ttgtcaaatt 


caaagatgct caaaaggtgt tccctacttt 


60 


gcatgagagg 


gagagctttg 


taacaggaaa 


ttgtataagg caaactctct attcattcct 


120 


aaggcctctg 


ttcattccta 


atgtttacat 


ggttctctac tctgaagggc accaacatgg 


180 


acctcacctt 


cttaacatgg 


aaaatcaaaa 


tctaaatgaa tacaattaaa agga 


234 
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<210> 24 

<211> 600 

<212> DNA 

<213> Homo aapien 

<400> 24 

actgcattgg tggcttctcc 
atggcaggga ggatatctgg 
cggaacatgg taaacattta 
aaagagccaa tcctgcattg 
tgtctcttca gcttcccact 
gcgagagggt caatgtgcaa 
gacagccccc agcatcctgc 
tatctcctgg gatggttccc 
ctgaaggaca aggccccaat 
tcattccacc taggacccct 

<210> 25 

<211> 496 

<212> DNA 

<213> Homo sapien 

<400> 25 

ggcaggtaca ttgtaataat 
cattcatggc cctctgcaag 
atcctttgaa ggtctgagag 
cgccagccca gactgccatt 
agaccatccc ggagtcccca 
gaagggatct cctggcagct 
tgttgataag atggcagtgt 
gccattttct gtggtaggtt 
gctccaacac agggtg 

<210> 26 

<211> 1690 

<212> DNA 

<213> Homo sapien 



21 



atagggaggg ggtcctatct 


catagactcc 


aaacttcatg 


60 


atttgctcac 


ccctgtattt 


tcagagcctc 


acacagtgcc 


120 


atactatctg 


tcaaaaacaa 


atgactgaaa 


gagcagatgg 


180 


aagagatatc 


catggtggcc 


ctaaaactac 


tcaaaccaga 


240 


ggaaaatgga 


ccggtgggca 


aaccaccatc 


tcacatctca 


300 


aagtatttaa 


atagcgacga 


ggtcccacac 


cccatcccag 


360 


tccaacataa 


gcagaaaaag 


acaatgaggt 


gcaaagagct 


420 


gggtggtctg cattgccaca 


cagcacatca 


tgctttgatc 


480 


aagccctatt 


tctcagcaca 


gtcttaaaat 


cacctggaaa 


540 


cgtgaatcag 


cactgatccc 


aagcgacacg 


ggaaccccta 


600 



aaccacaaaa taattgtata 


aggagaaaaa 


taataactga 


60 


gcacagtatc tgctttctca 


tttggtcctt 


cgttggttcc 


120 


cagcaggccc attccatgaa 


caccaaagcc 


cttcctacca 


180 


tcccctccag aaggccagtg 


ttcatcatcg 


atagggctag 


240 


tgtttcagga ctcccgaatc 


ttcaaatagt 


ccatctttaa 


300 


aagagggtga agctaatcgt 


tctacaaagt 


agtgcctaac 


360 


gtaggaagct gtgtgttggg 


tcggattccc 


cttttcatga 


420 


ccacggacat ggacaccatg 


gcctcatgga 


agcatgaaca 


480 








496 



<400> 26 

atgcctagca cggatgctgg tgcctcgcag ggtttacgaa tgatctgtcc cgtctccagt 



60 
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gacgccctct 


caggacccac 


taccgaagtc 


attaggactc 


ctttgtttgc 


agatgacagt 


120 


taccgagctc 


agataagagg 


tggcaaagag 


tgccagtccc 


ccatcaaggg 


tggacccaac 


180 


aggtggacag 


ctcagagagg 


cctggatgcc 


ggggatgggc 


tctcagcaga 


tgtaggagga 


240 


gctcaagtgc 


tggcaacagg 


caagacccct 


9999Ctgaaa 


ttgatttcaa 


gtacgccctc 


300 


atcgggactg 


ctgtgggtgt 


cgccatatct 


gctggcttcc 


tggccctgaa 


gatctgcatg 


360 


atcaggaggc 


acttatttga 


cgacgactct 


tccgacctga 


aaagcacacc 


tgggggcctc 


420 


agtgacacca 


tcccgctaaa 


gaagagagcc 


ccaaggcgaa 


accacaattt 


ctccaaaaga 


480 


gatgcacagg 


tgattgagct 


gtaggtgagc 


agtgacgtga 


agaggggttc 


tagccccgtg 


540 


gaaaacagcc 


catggttaac 


atctcaggat 


gttctgcatt 


caaacaccca 


aggctggtaa 


600 


tgaactttca 


catggactga 


atattggagg 


caaataatag 


aaggaataga 


atatacagtg 


660 


cctctgtcct 


gaaggaaaat 


atcatgcctc 


ttctggaaga 


aacggactgc 


acagaggaag 


720 


gattgagcaa 


tttagcctgc 


agtggaagaa 


ggtggacacc 


aaaagcttca 


ccctgtgttg 


780 


gagctgttca 


tgcttccatg 


aggccatggt 


gtccatgtcc 


gtggaaccta 


ccacagaaaa 


840 


tggctcatga 


aaaggggaat 


ccgacccaac 


acacagcttc 


ctacacactg 


ccatcttatc 


900 


aacagttagg 


cactactttg 


tagaacgatt 


agcttcaccc 


tcttagctgc 


caggagatcc 


960 


cttcttaaag 


atggactatg 


tgaagattcg 


ggagtcctga 


aacatgggga 


ctccgggatg 


1020 


gtctctagcc 


ctatcgatga 


tgaacactgg 


ccttctggag 


gggaaatggc 


agtctgggct 


1080 


ggcgtggtag 


gaagggcttt 


ggtgttcatg 


gaatgggcct 


gctgctctca 


gaccttcaaa 


1140 


ggatggaacc 


aacgaaggac 


caaatgagaa 


agcagatgct 


gtgccttgca 


gagggccatg 


1200 


aatgtcagtt 


attatttttc 


tccttataca 


attattttgt 


ggttattatt 


acaatgtaca 


1260 


tggctgttgc 


atagaagaca 


tgactggtgg 


aggctgagga 


aagccatgac 


attctacaat 


1320 


tgccatcagg 


ctaaggcccc 


gtgagcattt 


ctctcccttg 


taatattaac 


cctgtatttc 


1380 


tgggatcaca 


tcacggaata 


ttctttgcct 


ttccactttc 


caggaaatct 


ctcggactgg 


1440 


gctaccctcc 


ttgtgtgtga 


tgaaagatga 


gctatatttc 


agaacaaagt 


gctgtgttgt 


1500 


catgatttgc 


ctggactccc 


agggcgtctc 


ttacccaact 


tgataacgat 


gctgttcatt 


1560 


agcagccttt 


gttaactgat 


aaccaagagc 


ggtaatgtga 


tactcataag 


caattttctg 


1620 


tgtgtaggat 


aaaataaacc 


atcttgtatg 


ggaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1680 


aaaggcggtc 












1690 



<210> 27 
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<211> 461 
<212> DNA 
<213> Homo sapien 

<400> 27 



gggtgataat 


catataggcg 


aatggtctct agatgctgct cgagcggcgc 


agtgtgatgg 


60 


atgcgcccgg gcaggtacca 


agctcccagg cccttcctct atcatggatg 


ctgggtgact 


120 


tcgggaagtc 


accacctctt 


cccaagcctg tttcccatat cacagatgtg 


gggccatggc 


180 


ctcgatgatg gtctccacag 


gtctttccac ctctgtgagt ccaagtcagg 


tcaatcagca 


240 


aggacccatc 


tctgccctgg 


gtcagctcct cagaaccaac ccccagcatc 


cctaaagcaa 


300 


aagcctcacc 


tcaagggctg 


ctcagaagag agcaccttca gcatgagttg 


ttgctggaag 


360 


atctaataag 


ctgtgtttcc 


tqqgaagtgg tgctttactt agccctgtgg 


acaacttctc 


420 


tatgcatctg 


tgtgagcaga 


tgatcattgt attacctttt a 




461 


<210> 28 

<211> 4043 

<212> DNA 

<213> Homo sapien 








<400> 28 
ccgcttaatt 


aaagatcttt 


tttttttttt tttttttctc ataaacagga 


actttattaa 


60 


actacatgtt 


acataaaaga 


acatataaat ggaccattaa atacattcag 


tttattttaa 


120 


acaaatttac 


atagatactt 


atttacattt ctccattgta ttcttaaatt 


atttttccaa 


180 


gcttactacc 


gataaaaggt 


aatacaatga tcatctgctc acacagatgc 


atagagaagt 


240 


tgtccacagg 


gctaagtaaa 


gcaccacttc ccaggaaaca cagcttatta 


gatcttccag 


300 


caacaactca 


tgctgaaggt 


gctctcttct gagcagccct tgaggtgagg 


cttttgcttt 


360 


agagatgctg 


ggggttggtt 


ctgaggagct gacccagggc agagatgggt 


ccttgctgat 


420 


tgacctgact 


tggactcaca 


gaggtggaaa gacctgtgga gaccatcatc 


gaggccatgg 


480 


ccccacatct 


gtgatatggg 


aaacaggctt gggaagaggt ggtgacttcc 


caaagtcacc 


540 


cagcatccat 


gatagaggaa 


gggcctggga gcttggtacc cagggttcct 


ccaagaggtc 


600 


ccgatccctc 


tgctatccac 


aaatccacaa acttagaatc acttgcatcc 


attttccacc 


660 


accatggacc 


ttatgctttg 


aactgttttg tctacctgat taaatatata 


acttcttagg 


720 


cttcttgaga 


tggtgaaaag 


cagctcagga ttcccataag caactttgtg 


gagccctgga 


780 


aatacctact 


cagggctgtt 


tttacaagag gttttgtggt caggtgcttt 


actacttcag 


840 


ccataacgtt 


tacctttaaa 


actcagctga cttatggaag ctcagcattt 


ccaattcgct 


900 


tagatgacag 


gcaacagtct 


gcagaaggga ggttctaacg tcaaccacgt 


ggattcccca 


960 
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caaacgcatc 


atatttgcct 


aaatccatct 


atctaccaat 


gtcagatcta 


aaatgaggtt 


1020 


tcactaataa 


gtgacctaaa 


attaaaaaaa 


acacaaaaaa 


gtttcttaaa 


aaaaaaattc 


1080 


caagaatttc 


cccgtttccc 


aattagtctc 


agaggagtcg 


tgaaatgggc 


attgggtaga 


1140 


aaaagtccat 


gttttcaact 


gtttcttttg 


aataaagcct 


gattccccca 


ctctgccccc 


1200 


aaacttttcc 


cgatgatata 


cattcttctg 


aaagcaattt 


catgctttta 


gtctgcttta 


1260 


aaaaacctga 


cttggttctt 


ctcaggattt 


aaaaaccact 


gtaccaagga 


agagatgtcc 


1320 


acgttaaaaa 


ctattattaa 


gagacagaga 


caaggagaga 


gagactctct 


ccaggccatg 


1380 


ttagaaaaga 


aggaagtgct 


gggagcacac 


aaagagagaa 


gaaaggctgg 


gagacagaat 


1440 


ggtttgcacc 


tccctgtaac 


cagagtgaac 


acaggtcata 


gcagccttca 


ctcagtttta 


1500 


taacagacac 


acgaatggag 


gaacatggga 


ttggaaagac 


gcacactgtc 


aaacagccct 


1560 


gatgcctttc 


cactaaaata 


tggcacttct 


agattccaga 


gaaatccagc 


attctaccag 


1620 


gggctgccac 


ccatctgcag 


ggaagcctgc 


cctgcaacca 


tgaggccctc 


aaggctcata 


1680 


ctccactgct 


cacaatgcgc 


gccctgaata 


catggaagca 


tctgatttct 


gtctgagtcc 


1740 


aactttccct 


cctgttccta 


gacagactca 


cacccagagt 


catgcacgga 


gaagtgggca 


1800 


agccagacaa 


tcttcaggga 


gacacagccc 


atcacaggat 


acccaaggct 


tatggggaag 


1860 


tccagttgcc 


agcaacggac 


cctgtgtggg 


caggtgaaac 


tctgaagcca 


gagaacacag 


1920 


gaagataaaa 


atatcttcat 


actgaggata 


tactgcacaa 


gtgtggtggc 


tcacacctgt 


1980 


aatcccagca 


ccttgggagg 


ccaaagtggg 


ctgatagctt 


gagcctagga 


gtttgaaact 


2040 


agcctgggca 


acataatgag 


accctaactc 


tacaaaaaaa 


aaaaaaatac 


caaaaaaaaa 


2100 


aaaaaaatca 


gctgtgttgg 


tagtatgtgc 


ctgtagtccc 


agctatccag 


gaggctgaga 


2160 


tgggagatca 


cctgagccca 


caacctggag 


tcttgatcat 


gctactgaac 


tgtagcctgg 


2220 


gcaacagagg 


atagtgagat 


cctgtctcaa 


aaaaaaaaat 


taattaaaaa 


gccaggaaac 


2280 


aagacttagc 


tctaacatct 


aacatagctg 


acaaaggagt 


aatttgatgt 


ggaattcaac 


2340 


ctgatattta 


aaagttataa 


aatatctata 


attcacaatt 


tggggtaaga 


taaagcactt 


2400 


gcagtttcca 


aagattttac 


aagtttacct 


ctcatattta 


tttccttatt 


gtgtctattt 


2460 


tagagcacca 


aatatatact 


aaatggaatg 


gacaggggat 


tcagatatta 


ttttcaaagt 


2520 


gacattattt 


gctgttggtt 


aatatatgct 


ctttttgttt 


ctgtcaacca 


aaggatggac 


2580 


agtgattcag 


aaccgtcaag 


acggtagtgt 


tgactttggc 


aggaaatggg 


atccatataa 


2640 


acagggattt 


ggaaatgttg 


caaccaacac 


agatgggaag 


aattactgtg 


gcctaccagg 


2700 
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25 








taacgaacag 


gcatgcaaaa 


taaaatcatt 


ctatttgaaa 


tgggattttt 


tttaattaaa 


2760 


aaacattcat 


tgttggaagc 


ctgttttagg 


cagttaagag 


gagtttcctg 


acaaaaatgt 


2 820 


ggaagctaaa 


gataagggaa 


gaaaggcagt 


ttttagtttc 


ccaaaatttt 


atttttggtg 


2880 


agagatttta 


ttttgttttt 


cttttaggtg 


aatattggct 


tggaaatgat 


aaaattagcc 


2940 


agcttaccag 


gatgggaccc 


acagaacttt 


tgatagaaat 


ggaggactgg 


aaaggagaca 


3000 


aagtaaaggc 


tcactatgga 


ggattcactg 


tacagaatga 


agccaacaaa 


taccagatct 


3060 


cagtgaacaa 


atacagagga 


acagccggta 


atgccctcat 


ggatggagca 


tctcagctga 


3120 


tgggagaaaa 


caggaccatg 


accattcaca 


acggcatgtt 


cttcagcacg 


tatgacagag 


3180 


acaatgacgg 


ctggtatgtg 


tggcactctt 


tgctcctgct 


ttaaaaatca 


cactaatatc 


3240 


attactcaga 


atcattaaca 


atatttttaa 


tagctaccac 


ttcctgggca 


cttactgtca 


3300 


gccactgtcc 


taagctcttt 


atgcatcact 


cgaaagcatt 


tcaactataa 


ggtagacatt 


3360 


cttattctca 


ttttacagat 


gagatttaga 


gagattacgt 


gatttgtcca 


atgtcacaca 


3420 


actacccaga 


gataaaacta 


gaatttgagc 


acagttactt 


tctgaataat 


gagcatttag 


3480 


ataaatacct 


atatctctat 


attctaaagt 


gtgtgtgaaa 


actttcattt 


tcatttccag 


3540 


ggttctctga 


tactaagggt 


tgtaaaagct 


attattccag 


tataaagtaa 


caaacacagt 


3600 


ccctagatgg 


attgccacaa 


aggcccagtt 


atctctcttt 


cttgctatag 


ggcacaggag 


3660 


gtctttggtg 


tattagtgtg 


actctatgta 


tagcacccaa 


aggaaagact 


actgtgcaca 


3720 


cgagtgtagc 


agtcttttat 


gggtaatctg 


caaaacgtaa 


cttgaccacc 


gtagttctgt 


3780 


ttctaataac 


gccaaacaca 


ttttctttca 


ggttaacatc 


agatcccaga 


aaacagtgtt 


3840 


ctaaagaaga 


cggtggtgga 


tggtggtata 


atagatgtca 


tgcagccaat 


ccaaacggca 


3900 


gatactactg 


gggtggacag 


tacacctggg 


acatggcaaa 


gcatggcaca 


gatgatggtg 


3960 


tagtatggat 


gaattggaag 


gggtcatggt 


actcaatgag 


gaagatgagt 


atgaagatca 


4020 


ggcccttctt 


cccacagcaa 


tag 








4043 


<210> 29 

<211> 176 

<212> DNA 

<213> Homo sapien 












<400> 29 
ttataataac 


aaaaaaagtg 


tgaggggatg 


ttttcccaag 


cccccttctc 


cggtgggcgg 


60 


ctctcgtgtg 


aggacccatg 


gtgacagagt 


ctctctcatc 


tcctcactct 


gaaagcattc 


120 


cactggggag 


agtcaaccct 


ggctcggggc 


ttcctccaca 


cagcacacgg 


cccttc 


176 
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<210> 30 

<211> 1332 

<212> DNA 

<213> Homo sapien 

<220> 

< 2 2 1 > mi sc_f eatur e 

<222> (623) . . (642) 
<223> 

<220> 

<221> misc_f eature 

<222> (623) . . (642) 

<223> a, c, g or t 



<400> 30 


l_* i. 1— y d CH— "w I— 


c a t c e c crcr era 




ggcggcggcg 


y y ^ayaa 




cccaccatca 


acgccatcac 


m a n a 

yCCCaCay Ly 


atcacctcca 


tgtccaaccc 


g ccyyy cctg 


gcctctgtcg 


ctggacacat 


cattggcacc 


accgtgggcc 


gcaggaggag 


gaagcgtcgc 


ateeggeggg 


agaggaacaa 


ccgggagctg 


acagagaagc 


tgcaggcggt 


tgggtgggct 


ggagtgagag 


ccccgggggt 


acccacaaat 


gccctacaga 


tgagtcagtg 


gggtcagcat 


tgtgttgatt 


gannimnnnn 


cagggcccag 


agggaagaaa 


tgaacaacct 


cagcacctga 


gtgttctctg 


ctcacagaac 


gggtgggagc 


cagggcaggt 


ccagacctgc 


ccactcccat 


gcctgccgcc 


atcaggagtg 


gccagagaga 


agtcttgatg gagataggtc 


ttttctcatt 


tcctttatgg geaggegggt 


ggccatggta 


ccgcgtgctg ctcttgtgag 


cctggggacc 


cgcagccctg cccttccctc 


actcccttca 


gagcctggcc ctgcaggaag 
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cagcagcggc 


tctcctgcgc 


acgccgagtc 


60 


attcegggta 


gatatgectg gctcaggcag 


120 


gaccagccag 


gaectgeagt 


ggatggtgca 


180 


ataccctcgc 


tcgcatccct 


acagccccct 


240 


ggccctccca 


agacctggcg 


tgatcaagac 


300 


agatgagcag 


ctgtctcctg 


aagaggagga 


360 


getggctgea 


gccaagtgcc 


ggaaccgacg 


420 


gaggaactct 


gcgtagggtg ggagcacctc 


480 


cctgatcctc 


tctcccacag 


ctgtctcctt 


540 


gagatagagc 


tttccttcct 


ttggacacat 


600 


nimnimnnnn 


nngtggcgtt 


t age ac cage 


660 


tctgtggggg 


tgctagccac 


ggtttttacg 


720 


tgtgggggct 


cctgggtgag 


ggatgatgga 


780 


cccatccagg 


gctgcttccc 


gaggttccag 


840 


gcccagcctg 


gcctaagcac 


tctgggttgg 


900 


acgtggcagc 


agggttgtat 


tttcttgggg 


960 


tccccacctc 


tatcatgaac 


tcagagctgt 


1020 


tcactctccc 


gtgccgccta 


attgeccaga 


1080 


tgttaagctt 


gacctgacca 


aegtcaegta 


1140 


ggccgtgtgc 


tgtgtggagg aagccccgag 


1200 
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ccagggttga ctctccccag tggaatgctt tcagagtgag gagatgagag agactctgtc 1260 

agtggtcctc acagtgactg cccaggatgg ggcttgggaa aacatcccct cacacttttt 132 0 
ttgttattat aa 1332 



<210> 31 

<211> 571 

<212> DNA 

<213> Homo sapien 








<400> 31 
gccgcccggg 


caggtacctg tagtcccagc 


tactgaggag gctgaagcag gaggatccct 


60 


cgagcacaaa 


agtttcaggc tgtggtaagc 


tatgactgtg 


ccactgcaca ccagcctgag 


120 


ttacagaggg 


agatcccaac tcttaaaaac 


taaaacaaca 


ataaatatat acaagaatca 


180 


taacataaag 


ggattcatgc ttagaaaaaa 


tccataaact 


cccttctaaa tattgagaca 


240 


ctccaggctt 


ctttcagaca aataacttct 


aattattcca 


tatttttcaa gttattaacc 


300 


aagataaaga 


atctctcagt tagtggggaa 


aatgaaaatt 


attaagaata gaattgtctt 


360 


ctgactttaa 


aaacaattta gactttaaaa 


catgaactgt 


ttactcaggc tggtgatact 


420 


ctagtttgtt 


agtataccat acttgaagat 


atcatcaaga 


tcactatagt tgtatatatt 


480 


ctctattttt 


atatgtaatg ttaacttagt 


tcaagtattt 


tttgcttgta tcgttaactg 


540 


atcatcaaat 


acaatcctaa agatatatca g 




571 



<210> 32 

<211> 240 

<212> DNA 

<213> Homo sapien 

<400> 32 

ccgggcaggt acatattcta cagaaaaaaa tcaaatatgt atacaaataa gtatgcacag 60 

gatttagaat cttacataaa aatgtattta acttggctgg agtgtgtgtg tgtgtttccc 120 

aggttaagta aaattagaaa gccagaatca caagccacaa aaaagaaaaa ctgataaatt 180 

agataccatc aaaattaaaa tctctaactg ctacaaataa taccatcaag aatgggaatt 240 



<210> 33 

<211> 1026 

<212> DNA 

<213> Homo sapien 

<220> 

<22l> misc_feature 

<222> (883) . . (883) 

<223> a, c, g or t 
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aatagctcga 


ccgcgcgttt 


tgatggatga 


gcgacgaggt 


caggaggacc 


tgtatcacaa 


60 


aaaaaaagaa 


aagggaaaaa 


agaagagaaa 


gcagcagcat 


gatacctgac 


atgacagatg 


120 


tgggagaccc 


acagcctgca 


gacactgtgt 


ggctggaagg 


tggcgacggg 


agtggtgccg 


180 


gtggagtgtg 


gagctgtctg 


aaagtgacgc 


agcagacagt 


agaagcatag 


gtgggcgaag 


240 


cccaggtgac 


cctcaaaacg 


ttgcacaaga 


acatcaggga 


aaaagaacta 


gaatccttta 


300 


aggaaaatgt 


tcttcatgta 


tgagagacta 


aagtgatttt 


tctaagaaag 


ttcagccctt 


360 


ctctgactta 


cctggacatt 


tctagatact 


tccaaaggac 


cctctgggaa 


tccatagctt 


420 


cctaatctgg 


agatgggagg 


tcataaggga 


gacgctgtgg 


ggttccttga 


agtttcttgg 


480 


gttcacagag 


gagccccctc 


acttggtgtt 


ctcccgtgag 


ccagcctcca 


cctgccaaag 


540 


acactctggt 


cctcgtatag 


tgagtaatgg 


ggctcagggc 


ctctccaaca 


acagagagga 


600 


gctgatgctg 


tagggctgac 


cccgtgactt 


cctggagtcc 


tcaccctgtc 


cagtgctttg 


660 


agattcttcc 


cacctcccca 


tcctcaccag 


ccggatcggg 


cgctgtgcag 


tgtggtcagc 


720 


atggtgaaga 


aagtcatttc 


cttggtggac 


agtattcctc 


tttatctctc 


attacactgg 


780 


aaatgttatt 


tctgctgtat 


catccgtgct 


ccaacgtttc 


tagtctgtca 


ggctcacctt 


840 


ctctctggaa 


agaattgctt 


aacttgacat 


tccatgtgcc 


gcntataaaa 


tatattttga 


900 


aagaacaaaa 


aaaaccaaaa 


aaaaaaaaac 


ggtgggggta 


acctgggcca 


aaacgcgtcc 


960 


ccggggtgaa 


ttgttctccg 


ccccactcaa 


ccccaccaac 


aaaaaaataa 


acagacaaaa 


1020 


aacaaa 












1026 


<210> 34 
<211> 1545 
<212> DNA 
<213> Homo sapien 












<400> 34 

gccgacaatg acatcggagc 


cgtctcaacc 


acagggcatg 


gggaaagcat 


cctgaaggtg 


60 


aacctggcta gactcaccct 


gttccacata 


gaacaaggaa 


agacggtaga 


agaggctgcg 


120 


gacctatcgt tgggttatat 


gaagtcaagg 


gttaaaggtt 


taggtggcct 


catcgtggtt 


180 


agcaaaacag gagactgggt 


ggcaaagtgg 


acctccacct 


ccatgccctg 


ggcagccgcc 


240 


aaggacggca agctgcactt 


cggaattgat 


cctgacgata 


ctactatcac 


cgaccttccc 


300 


taagccgctg gaagattgta 


ttccagatgc 


tagcttagag 


gtcaagtaca 


gtctcctcat 


360 


gagacatagc 


ctaatcaatt 


agatctagaa 


ttggaaaaat 


tgtcccgtct 


gtcacttgtt 


420 
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ttgttgcctt 


aataagcatc 


tgaatgtttg 


gttgtggggc 


gggttctgaa 


gcgatgagag 


480 


aaatgcccgt 


attaggagga 


ttacttgagc 


cctggaggtc 


aaagctgagg 


tgagccatga 


540 


ttactccact 


gcactccagc 


ctgggcaaca 


gagccaggcc 


ctgtatcaaa 


aaaaaaagaa 


600 


aagggaaaaa 


agaaagaaag 


cagcagcatg 


atcctgacat 


gacagatgtg 


ggagacccac 


660 


agcctgcaga 


cactgtgggc 


tggaaggtgg 


gaagggaggg 


gccggtggag 


gtggagctgt 


720 


ttgaaagtga 


cacagcagca 


gtagaagcag 


tggtgggcga 


agcccaggtg 


accctcagaa 


780 


cgttgcacaa 


gaacatcagg 


gaaaagaacc 


agaatccttt 


aaggaaaatg 


ttcttcatgt 


840 


atgagagact 


aaagtgattt 


ttctaagaaa 


gttcagccct 


tctctgactt 


acctggacat 


900 


ttctagatac 


ttccaaagga 


ccctctggga 


atccatagct 


tcctaatctg 


gagatgggag 


960 


gtcataaggg 


agacgctgtg 


gggttccttg 


aagtttcttg 


ggttcacaga 


ggagccccct 


1020 


cacttggtgt 


tctcccgtga 


gccagcctcc 


acctgccaaa 


gacactctgg 


tcctcgtata 


1080 


gtgagtaatg 


gggctcaggg 


cctctccaac 


aacagagagg 


agctgatgct 


gtagggctga 


1140 


ccccgtgact 


tcctgagtcc 


tcaccctgtc 


cagtgctttg 


agattcttcc 


cacctcccca 


1200 


tcctcaccag 


ccggatcggg 


cgctgtgcag 


tgtggtcagc 


atggtgaaga 


aagtcatttc 


1260 


ctcggtgggc 


agtattcctc 


tttatctctc 


attacactgg 


aaatgttatt 


tctgctgtat 


1320 


catccgtgct 


caacgtttta 


gtctgtcagg 


ctcaccttct 


ctctggaaag 


aatttgctta 


1380 


acttgacatt 


ccatgtgccg 


ctaataaaat 


atattttgaa 


agaaaaaaaa 


aaaaaaaaaa 


1440 


aaaaaaaacg 


gtgggggtaa 


cctgggccaa 


aacgcgtccc 


cggggtgaat 


tgttctccgc 


1500 


cccactcaac 


cccaccaaca 


aaaaaataaa 


cagacaaaaa 


acaaa 




1545 



<210> 35 

<211> 338 

<212> DNA 

<213> Homo sapien 

<400> 35 



tgatcactca ctataggcct 


ggtgctctag 


atcatgctcg 


agcggcgcag 


tgtgatggat 


60 


ggccgcccgg gcaggtactc 


agaatggagg 


ggctggaagt 


ggggccagtg 


gtcctggtag 


120 


tcagcctccc cttctgacac 


aagaattatc 


gtggaacagc 


ttgtgtgacc 


gatcaactgg 


180 


tctctccatc tttaagccat 


tgtcttgttg 


actctgttac 


tgcagagttt 


ggggaggtct 


240 


catggcttct cgggatccta 


tttcctgagc 


ttccctgaag 


ccaacttctg 


gccactggta 


300 


tgtgatgcga gcacagtgcc 


tcctcctgct 


cctccagg 






338 



<210> 36 
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<211> 1851 

<212> DNA 

<213> Homo sapien 

<400> 36 



gccgcatatt 


tttttttttt 


tttttttttt 


tttttcaagg 


ttggaaaatc 


ccactttact 


60 


tcaagtgaag 


tcactgggtt 


gtccccatga 


aatatctcca 


caatgggtgt 


gctagacctt 


120 


tactctctgt 


ggcaccacca 


agaccccctc 


accctgccag 


agccctctga 


agtgtactca 


180 


gaatggaggg 


gctggaagtg 


gggccagtgg 


tcctggtagt 


cagcctcccc 


ttctgacaca 


240 


agaattatcg 


tggaacagct 


tgtgtgaccg 


atcaactggt 


ctctccatct 


ttaagecatt 


300 


gtcttgttga 


ctctgttact 


gcagagtttg 


gggaggtctc 


atggcttctc 


gcatcctatt 


360 


tcctgagctt 


ccctgaagcc 


aacttctggc 


cactggtatg 


tgatgegage 


acagtgcctc 


420 


ctcctgctcc 


tccaggaagc 


ttctgetgat 


tgaaegcagg 


ccagatggcg agatceggag 


480 


caaccttctt 


agate aaggg 


aagaaagggg 


cacaagaggg 


gagtaggtaa 


caagataaaa 


540 


ggagctccct 


ccctgatgac 


tacagageca 


tcgtggcagc 


cctgggcctc 


catttcagac 


600 


gttcatcttg 


cctaagccac 


caccatcagg 


gtctcagtca 


gtcatcattc 


tcatttactc 


660 


gggtggctgg 


gtgagggegg 


aatactacct 


tccagctgtc 


tgagattaag 


cctaagccac 


720 


caccatcagg 


gtctcagtca 


gtcatcattc 


tcatttactc 


gggtggctgg gegagggegg 


780 


aatactacct 


tccagctgtc 


tgagattaag 


cagaacagca 


gctaaagcag 


taacagcagg 


840 


tctccttcag 


cagcttgcca 


cagggaagag 


ggtcccgcgt 


gaaccgaact 


caacttccac 


900 


ctgcgcagag 


gtagctacca 


tttgegtaga 


ggtagctggg 


tctttaaggt 


tagagggaag 


960 


gcgcagggag 


ggagagaegg 


cgggtggggg 


ttgaacaaag 


tggagattca 


caaaagcaga 


1020 


ctagggcggg 


egaegtgate 


agatgacctg 


tgegggegge 


agcctcctgc 


cctcctcccc 


1080 


ttcgtgcgcc 


ggctggagcg 


aagagttctt 


ttgacagccg 


tgagcttccc 


cgccaggaac 


1140 


ttactggggc 


tgcatcaccc 


tagaaaegtg 


gctttgggct 


gtggaaacgc 


tgcctctgtg 


1200 


gaagtctctc 


etcgeggggg 


tggacgggtc 


gctgcgcgcc 


cagegttett 


ctgcggttct 


1260 


cacagcccgc 


cgccgccgcc 


gectegggga 


ectttgeggg 


gaggectcag gtcagcgccg 


1320 


cccttgcgat 


ggegggagag 


cagagcggcc 


ccagggcctc 


tgagctcegg 


ccccggcacg 


1380 


tcccgccctc 


tcctccgctt 


gcgggagccg 


gggcgcctcg 


gagggaaacc 


ttccggacac 


1440 


aggccgggag 


aagaggggee 


gtggcgccat 


cgcgcagctt 


cctggtttcg gccatgtttt 


1500 


tttttttttg 


agaeggagtt 


tegctctttt 


tgcccaggct 


ggagggcaat 


ggcgccgtct 


1560 


cggctccctg 


cagccgccgc 


ctcccgggtt 


gaagggactc 


tccggcctca ggccgcggtg 


1620 
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ttcttgtaaa 


ttagaaccgg 


cetafccrcacc 

\* w w ^ r3 ZD 




ctcgcggctg 


c oo acr acracra 


1680 


cccagcaggg 


ggcggggacc 


ccgggaggcg 


gggcctggcc 


gagctgcgca 


cacccgggtg 


1740 


gttctggcta cgggtacccg 


gacctgccca 


acgaccgttc 


tccattcccg 


acctccgcac 


1800 


cctacccggt 


ccgcagggca 


ccttcctcct 


ctctctcagg 


cacgtcctcg 


a 


1851 


<210> 37 

<211> 409 

<212> DNA 

<213> Homo sapien 












<400> 37 
gagagcaagt 


tcaagcaggg 


ccaggaacag 


gacagccggc 


aggagagcag 


gctcaacgag 


60 


gactttctgg gaatgctggt 


ccacaccagg 


tccctgctga 


aggagacact 


ggacatctct 


120 


gtgggactca ggaacaaata 


cgagctgctg 


gccctcacca 


ttaggagcca 


tgggacccga 


180 


ctaggtcggc 


tgaaaaatga 


LLu^L'l* C UU.CI 


y ia ^ c'-y y *-y y 




cl i>v^ w way CLO.CL 


240 


gagggaatca aatcagcccc 


gttttggagg 


gtgggggaca 


gaagatgggg 


ctacatttcc 


300 


cccataccta 


ctattttttt 


atatcccgat 


ttgcactttg 


agaatacatc 


taaggtcatc 


360 


tttcaaaaga gaaaaattgg 


acacttgagt 


gacttgtttt 


tagtttgtt 




409 


<210> 38 
<211> 2112 
<212> DNA 
<213> Homo sapien 












<400> 38 
ttgagagcag 


agagatcctc 


aggatatctt 


tagccaaagg 


aaaagctccg 


cattcccacc 


60 


cagtccagaa 


attgaaatac 


tatcaggggg 


caagagcctt 


tctctccagc 


tacacactcc 


120 


atctcccggg 


agcaagggga 


aactccgaga 


ggagggcaac 


agagccagca 


tcttgccagg 


180 


gccccggagg 


aggggttccc 


cgctacgcct 


gtgccggagg 


agttccagtc 


accgagcgag 


240 


gggcgcaagg 


gtgggtgcat 


cctgcgctgc 


ggcgggcgcg 


ctacccagac 


gctggtgtgc 


300 


agagccacat 


gaagcctgct 


ggggactggg 


ggccagggag 


cagcaagcca 


gctgggactg 


360 


aggcggacgc 


tgtctcaggg 


agacgctgac 


tcgcaaagac 


actcccttcc 


ttgtgcctgg 


420 


gtaaaaagtc 


tcctcctggg 


gtccctggcc 


atcctgaata 


tccagaatgg 


tgtttctgaa 


480 


gttcttctgc 


atgagtttct 


tctgccacct 


gtgtcaaggc 


tacttcgatg 


gccccctcta 


540 


cccagagatg 


tccaatggga 


ctctgcacca 


ctacttcgtg 


cccgatgggg 


actatgagga 


600 


gaacgatgac 


cccgagaagt 


gccagctgct 


cttcagggtg 


agtgaccaca 


ggcgctgctc 


660 


ccagggggag 


gggagccagg 


ttggcagcct 


gctgagcctc 


accctgcggg 


aggagttcac 


720 
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cgtgctgggc 


cgccaggtgg 


aggatgctgg 


gcgcgtgctg 


gagggcatca 


gcaaaagcat 


780 


ctcctacgac 


ctagacgggg 


aagagagcta 


tggcaagtac 


ctgcggcggg 


agtcccacca 


840 


gatcggggat 


gcctactcca 


actcggacaa 


atccctcact 


gagctggaga 


gcaagttcaa 


900 


gcagggccag 


gaacaggaca 


gccggcagga 


gagcaggctc 


aacgaggact 


ttctgggaat 


960 


gctggtccac 


accaggtccc 


tgctgaagga 


gacactggac 


atctctgtgg 


ggctcaggga 


1020 


caaatacgag 


ctgctggccc 


tcaccattag 


gagccatggg 


acccgactag 


gtcggctgaa 


1080 


aaatgattat 


cttaaagtat 


aggtggaagg 


atacaaatgc 


tagaaagagg 


gaatcaaatc 


1140 


agccccgttt 


tggagggtgg 


gggacagaag 


atggggctac 


atttccccca 


tacctactat 


1200 


ttttttatat 


cccgatttgc 


actttgagaa 


tacatctaag 


gtcatctttc 


aaaagagaaa 


1260 


aattggacac 


ttgagtgact 


ttgtttttag 


ttttgttttt 


gtacattatt 


tatgtgattg 


1320 


ttatggaatt 


gtcacctgga 


aagaacaatt 


ttaagcaatg 


tcatttctag 


atgggtttct 


1380 


aattctgcag 


agacacccgt 


ttcagccaca 


tctaaaagag 


cacagtttat 


gtggtgcgga 


1440 


attaaacttc 


cccatcctgc 


agattatgtg 


gaaataccca 


aagataatag 


tgcatagctc 


1500 


ctttcagcct 


ctagccttca 


ctcctgggct 


ccaaaagcta 


tcccagttgc 


ctgtttttca 


1560 


aatgaggttc 


aaggtgctgc 


tttgcatgcc 


tgccaaccca 


tggaagttgt 


ttcttacttc 


1620 


ttttctctct 


tatttattaa 


ccatggtctg 


agagttgttt 


ttgttctatg 


taacagtatt 


1680 


gccacaaaac 


tataggcaaa 


tcgtgtttgc 


agggagattt 


ctgatgcctc 


tgtgggtgtg 


1740 


tgtaagttaa 


agtggccaca 


tttaagaagg 


ccaagctttg 


tagtggttgc 


acagtcacac 


1800 


tgatatgctg 


atttgctctt 


tctcattgta 


tgtctatgct 


ttgtcatcag 


tgctatagta 


1860 


aattacaaag 


aaataggtag 


attgtatgaa 


catacccaca 


aatgcctatg 


atttaggtta 


1920 


ccaatgtatt 


ctttctcatt 


tggggttttg 


cttctgtctg 


tctgtttatt 


ggaaacttgt 


1980 


acttcaagta 


gggggaatcc 


taattctaat 


aactccttag 


ctaagtttta 


ttattcaggc 


2040 


aataaacatg 


ttttcatgta 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


2100 


aaaaagatcg 


gc 










2112 



<210> 39 

<211> 713 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (260) . . (539) 

<223> a, c, g or t 
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<400> 39 

ccgcccgggc aggtacctac ctgatgaagc tgttcatgct tccagcatca aactggacgt 60 

ttactgtctg cccaaaagcc agtttccaaa aggtttgctt ccctctgttc agtggattct 120 

tgactacata taggtcatat atttcaaaaa ataatgccta gctatttcta ctttgaaatc 180 

atgactaaag ccaaaccaca accacagcaa agataaccta aggatttgtt taccagaaat 240 

acctacaaaa aagtttgcan nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

inn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc 54 0 

caccctgcca aaaagtttgc catggtccgg caggcaaagt tgcaaccgga gccgttcctg 600 

ggtgaaaagc ttggcggtaa tccttgtcct atccttgttc ccttgtgtgg acatttgttt 660 

tccccgctcc ccaatctccc ccataccctc ccacaaaaaa aaaccaggac aac 713 

<210> 40 

<211> 338 

<212> DNA 

<213> Homo sapien 

<400> 40 

ctgatttaaa gtcatagaat ttactgaaat acctacacag ctctgaatcc aacatttttg 60 

tcatggttat tagaagaaac caaaacaact gactgaaaat gaaatgttca gtcagacagt 12 0 

gtttgtccac atcttttttt ttttgaagtt ttgaatcatc cattcacaga aaacacttgt 180 

ttatgggtat aatgcaaaac ttttgaaaca aaaaaaccta ctaaaaatgc tttcgctaaa 240 

gtgattggct tttcattcat gctttgaaat aaaattatct agaaaggttg gagaagggtt 300 

tgccgaacag ccatcttcct gatgtgcctt agattaga 33 8 



<210> 41 

<211> 805 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (241) . . (520) 

<223> a, c, g or t 



<400> 41 



WO 02/068633 

aatgctgctc gagcgcgcgc agttatgatg 
ctatttgtaa aatggaaata atggtagctg 
aaacttgaaa agacttatta cagtagctgg 
atgctatctc atattagcat atatggaatt 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
tatatggaac ggtgtcttcc ctttcagagc 
gtttatagag ctaagagaaa tcgggccatg 
cctcttcatg ttatattaat catgcataat 
gcaaccatca cagtaggaag cttgggtatc 
ttcgtcaatc caaaaaaaaa aaaag 

<210> 42 

<211> 300 

<212> DNA 

<213> Homo sapien 

<400> 42 

gaatgtatga tcactatagg gccatgggtt 
gatggatcgt ggtcgcggcg aggtacccct 
tggtcactgg aggaaatgat gagtagagag 
gagtcctctg gggcacaaga gtggttgagg 
tgtgtcagga atcctggatg cccagaaaga 

<210> 43 

<211> 561 

<212> DNA 

<213> Homo sapien 

<400> 43 

ggactgagaa ttcctgtggt ctcctgagat 
tgtggaaagt tttagttctt agcgactcca 
tgtacaaggt accccttgag ccttggcaag 
aatgatgagt agagagtgac aagcagagga 
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34 

gatgtggtcg cggccgaggt ctcagtttcc 60 

cctcaaacct cattacgaat tcaatgagtt 120 

cacataaaga cttgatagta gtttatatgg 180 

aatagtgtat cactatactt ttttttcttt 240 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 300 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 360 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 480 

nnnnnnnnnn ttactatgct atcaattcca 540 

tctttaaagc tctgcagaag atttacatgt 600 

gaaatgtggg tgtgtaacta acaaatttaa 660 

tctatcttca tcctctgaaa gtggcctaga 720 

atggccatag ctggtcctgg tgtgaattgt 780 

805 



atctagatgc tgctcgagcg cgcgcagtgt 60 

tgagcctggg caaggtgcat gacacacact 120 

tgacaagcag aggaggggag ctttgggcct 180 

cttggcactt gccacctaga tttcagacaa 240 

aacctgctca gggtctggag ccccaacata 300 



cagtttcttt ccaccagatt gtaaacaagc 60 

cgacccacac tagtcaggca gaagccccca 120 

gtgcatgaca cacacttggt cactggagga 180 

ggggagcttt gggcctgagt cctctggggc 240 
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acaagagtgg 


ttgaggcttg 


gcacttgcca cctagatttc agacaatgtg tcaggaatcc 


300 


tggatgccca gaaagaaacc 


tgctcagggt ctggagcccc aacataaagc ttfctatttag 


•a c r\ 


gcaatgttga gcactaatat 


gtggttggag ctctgcaggg ggtctccatg ggtgcactgt 


420 


gtagcaaagc 


tgtgggagaa 


aggccttacc cctcaaaatt ccagaattat aaatctatca 


480 


gccgcttgca acttcagcct 


ggaaaagcca tgggcattaa actccaaact gtgagagcag 


540 


ccatgtggct 


gcaccctcaa 


g 


561 


<210> 44 

<211> 530 

<212> DNA 

<213> Homo sapien 






<220> 

< 2 2 1 > mi s cofeature 
<222> (102) . . (182) 
<223> a, c, g or t 






<400> 44 
gttttaaaaa 


gaggagctcc 


cctgcatgag atctcacttt ttgcctgcta ccatttatgt 


60 


aagatgtgac 


ttgctccttc 


ttgccttcca tcatgactgt gnnnnnnnnn nnnnnnnnnn 


120 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 


180 


nngcccgccg 


cgggcgtctc 


cacccgcctc tccgccgccg ccccaccccc gccgccccgc 


240 


gcgccacgag 


cagatcggag 


cagcaggccg cgaggagcag gcgagcggcc gccagtaggc 


300 


gcgacgcgtg 


ccgcggagac 


gacgcccggg gcgtgcacgc cgctgaggcc gaggacagcg 


360 


atgcgccgcg 


cgaggcgcgg 


agcgaggagg agcggcgtgg accgccccag cggccgagga 


420 


cgggcgagcg 


gcgcggggag 


gcccgaaggc gaggaggagg tgcgggacgg ggggacgaga 


480 


agaggagaag 


agaggggaaa 


gggag<39g99 ggagcggaga agagggaggc 


530 


<210> 45 

<211> 709 

<212> DNA 

<213> Homo sapien 






<400> 45 
acaaaatagt 


aatgatagcc 


atggaagttt taccttattc tgtgagaagt gttcttaaac 


60 


ttattaagtg 


tctaaactaa 


ggtttagtgc ttttttaaag gaaagttgtc ccaggattca 


120 


tcctaaagaa agcaaaagtt 


aattcaactg atccaccaat ggaattagat gggtagagtt 


180 


gggttcttga gttttaccac 


cacttagttc ccgctgaatt ttgtaacttc ctgtgtttgc 


240 
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atcctctgtt 


cctattctgc 


ccttgctctg 


tgtcatctca 


gtcatttgac 


ttagaaagtg 


300 


cccttcaaaa 


ggaccctgtt 


cactgctgca 


cttttcaatg 


aattaaaatt 


tatttctgtt 


360 


ctagctggga 


acacaaacaa 


caactcaact 


acaaaatcac 


gctctgtggg 


cagtatccct 


420 


tgcgccctta gcctttcccc 


cgggtgtcac 


tttgtcttcg 


cccccccatt 


cccccccact 


480 


ctcccacact 


ctccctcctc 


ccatcacttc 


cctccctctc 


tcccgctctc 


ocfccrccaacc 


540 


cgtcagcgcg 


cgtcgcgctc 


gcgccgtgcg 


gctcgcctcg 


ccgtctctcc 


ctcctctccg 


600 


ccgcgtcccc 


gccgccgcgc 


gtctgcccgt 


ccctcgtcct 


ccgctcccct 


cctccgcccg 


660 


ccgcctcccc 


ccgcccgtct 


gccactcgct 


ccctagtgcg 


cgcgcgccc 




709 


<210> 46 
<211> 1808 
<212> DNA 
<213> Homo sapien 












<220> 

<221> misc_f eature 
<222> (153) . . (153) 
<223> a, c, g or t 












<400> 46 
tcataagagt 


gaaatagtca 


gctgctgacg 


gcacctcagc 


cacgccacty 


ttactcagtt 


60 


cagtgggtgt 


gcttgcgtgg 


taggatgtgg 


tgcagccctc 


tctacgctct 


tctatttttg 


120 


gtatatttyc 


tatttaacct 


tcaaatagct 


tcnaattctt 


tttttcttgg 


actggcttca 


180 


ttctgaattt 


gtgctaaaat 


aatctttcat 


aaagagacct 


cagtttatag 


cgtaacagac 


240 


tacacaatgc 


actgatgttt 


tcataatgtt 


taagggaccc 


actgcaagaa ggttgctgcc 


300 


tccttttaat 


tgtattcatt 


tagattttga 


ttttccatgt 


taagaaggtg aggtccatgt 


360 


tggtgccctt 


cagagtagag 


aaccatgtaa 


acattaggaa 


tgaacagagg 


ccttaggaat 


420 


gaatagagag 


tttgccttat 


acaatttcct 


gttacaaagc 


tctccctctc 


atgcaaagta 


480 


gggaacacct 


tttgagcatc 


tttgaatttg 


acaaatggtg 


ctgttgcaaa 


cacttttttt 


540 


ttgagatgaa 


gtctcgcggt 


tgtcacccgg 


gctggagtgc 


agtggcgtga 


tctcggctca 


600 


ctgcaacttc 


cacctcctgg 


gttccagcag 


ttctcctgcc 


tcagcctccc 


aagtagctga 


660 


gattacaggc 


gcctgccacc 


ccacctggct 


gatttttgta 


attttagtag agacggggtt 


720 


tcaccatgtt 


ggccaggctg 


attaactcct 


gacctcaggt 


gatccacctt 


tctcggcctc 


780 


ccaaagtgct 


gggattacgg 


gtgtgagcca 


ccgtgcccgg 


cctgcaaaca 


cattttaatt 


840 


gacaacacta 


gggctgttgt 


acaaaatagt 


aatgatagcc 


atggaagttt 


taccttattc 


900 
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tgtgagaagt 


gttcttaaac 


ttattaagtg 


37 

tctaaactaa 


ggttfcagtgc 


txuuuuaaag 




gaaagttgtc 


ccaggattca 


tcctaaagaa 


agcaaaagtt 


aacccaacxg 


auccaccaau 




ggaattagat 


gggtagagtt 


gggttcttga 


gttttaccac 


cacttagttc 


ccactgaatt 


lUOU 


ttgtaacttc 


ctgtgtttgc 


atcctctgtt 


cctattctgc 


ccttgctctg 


tgtcatctca 


1140 


gtcatttgac 


ttagaaagtg 


cccttcaaaa 


ggaccctgtt 


cactgctgca 


cttttcaatg 


1200 


aattaaaatt 


tatttctgtt 


ctagtgggaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1260 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1320 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


actgagaaga 


gggggtgccg 


1380 


tgcttgtcct 


tctccccacg 


ggcgggcgca 


gcgcgtgcac 


cacccccgtg 


gtgttaagtt 


1440 


aaaagggggc 


agcgggcccg 


ctcaggcggg 


tgtatataag 


agagagagag 


aggggcggct 


1500 


tacacacaaa 


atagatctgg 


cggcagaagg 


aggcgaataa 


aaaagttgct 


ttcagaagtc 


1560 


ctcatgatac 


ttgggctgaa 


gaagacaaga 


acatcctttt 


cagaatgcgg 


atgtagagat 


1620 


aatatcaatc 


gaaggagcca 


caccacccta 


agtagaaaat 


tagtcagggt 


cctcagtcca 


1680 


tattgagcag 


taaaaatatt 


caatgattca 


gaagtcgtta 


cggatctaca 


acagcaagca 


1740 


tgatgttact 


tcctgccgcg 


gaagactgag 


actttgctgc 


tcgagtatct 


ataagacaaa 


1800 


taatatcg 












1808 


<210> 47 

<211> 760 

<212> DNA 

<213> Homo sapien 












<400> 47 
aatgctgctc 


gagccgcgcg 


ttatgtgatg 


gatgccgccg 


ggcaggtact 


gggtcccgag 


60 


cgtggtgggt 


atttgccaca 


ctgggtggcc 


agctcagcag 


ccccccacct 


ctctttattc 


120 


tctccaaagc 


tggtctttct 


gactatcatt 


gtggtagggg 


gaggacagat 


gctaaaggtg 


180 


gaagctgacc 


tggagaaaga 


gacacacggg 


gtgactgtgg 


caaaggacag 


ctggaaaaga 


240 


aactctatca 


cttcttcatt 


ggcaaccaca 


aggcacccga 


ggccatggca 


ctcccagagg 


300 


ctgtgcgcag 


tagccaagcc 


tctcaacctc 


ttctggccct 


gcgtcctgca 


gcgaagtctc 


360 


tgctgtaaga cagtagactc 


cttcgatgag 


gtgctcaaaa 


atgctacccg 


gggtggtggt 


420 


gtctggcttg cagtctggcc 


cagttcagag 


aaagttgcag 


agatcagggg 


ccaaggatgt 


480 


catagcccca 


ggttgtcctc 


agggtcccaa 


tcctagggca 


gggtgtgcat 


ggaagcaaga 


540 


actatggaaa 


cctagctcca 


gtctgcaggc 


tctgagcccc 


tagttcctca 


ctccagcggg 


600 
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gctccctcac 


tgcacagaac 


ccaccccttc 


38 

tgtgtgggca 


ctgctgacca 


cacagatgac 


660 


ccagacccaa 


agagcctggc 


agaagctctg 


tgcgttggag 


ctgggctccg 


tcctccggtc 


720 


tggttcaggg ggatcaggaa 


ggctcttttc 


acctgtggct 






760 


<2i0> 48 
<211> 4940 
<212> DNA 
<213> Homo sapien 


• 










<400> 48 
tgaagttcag 


tagagccggc 


cgctgcaggc 


cagaaggtgg 


gagccagcgg 


gggcatcgcc 


60 


gcccgcgccc 


ctctaagtgc 


cgggccgcaa 


gctccaccgc 


agccgcctgc 


aagcagcggc 


120 


gcctcggccc 


tcgacctgcg 


cgcaaagcct 


gtgctggagc 


cgtcctcccg 


cggcggggac 


180 


cgggaccggg 


gacccaagcc 


aatcgaaagc 


tccaaccatg 


gccatggggc 


tcttccgcgt 


240 


gtgtctggtg 


gtggtgacgg 


ccatcatcaa 


ccacccgctg 


ctgttcccgc 


gggagaacgc 


300 


cacagtcccc 


gagaacgagg 


aggagatcat 


ccgcaagatg 


caggcgcacc 


aggagaagct 


360 


gcagctggag 


cagttgcgcc 


tggaggagga 


ggtggctcgg 


ctggcggccg 


aaaaggaggc 


420 


actggagcag 


gtggcggagg 


agggcaggca 


gcagaacgag 


acacgcgtgg 


cctgggacct 


480 


ctggagcacc 


ctctgcatga 


tcctcttcct 


gatgatcgag 


gtgtggcggc 


aggaccacca 


540 


ggaggggccc 


tcacctgagt 


gcctgggcgg 


tgaggaggat 


gagctgcctg 


ggctgggggg 


600 


cgcccccttg 


cagggcctca 


ccctgcccaa 


caaggccacg 


cttggccact 


tttatgagcg 


660 


ctgcatccgg 


ggggccacgg 


ccgatgcagc 


ccgtacccgg 


gagttcctgg 


aaggcttcgt 


720 


ggatgacttg 


ctggaagccc 


tgaggagcct 


ctgcaaccgg 


gacaccgaca 


tggaggtgga 


780 


ggacttcatt 


ggcgtggaea 


gcatgtacga 


gaactggcag 


gtggacaggc 


cactgctgtg 


840 


ccaccttttc 


gtgcccttca 


caccccccga 


gccctaccgc 


ttccacccag 


agctctggtg 


900 


ctccggccgc 


tcagtgcccc 


tggatcgcca 


gggctacggc 


cagattaagg 


tggtccgcgc 


960 


cgatggggac 


acattgagct 


gcatctgcgg 


caagaccaag 


ctcggggaag 


acatgctgtg 


1020 


tctcctgcac 


ggcaggaaca 


gcatggcgcc 


tccctgcggc 


gacatggaga 


acctgctgtg 


1080 


tgccacagat 


tccctgtacc 


tggacacgat 


gcaggtcatg 


aagtggttcc 


agacggccct 


1140 


caccagagcc 


tggaagggca 


tcgcccacaa 


gtacgagttc 


gacctggcct 


ttggccagct 


1200 


ggacagcccg 


gggtccctga 


agatcaagtt 


ccgttcaggg 


aagttcatgc 


ccttcaacct 


1260 


gattcctgtg 


atccagtgtg 


atgactcgga 


cctgtacttt 


gtctcccacc 


ttcccaggga 


1320 


gccctctgag 


ggcaccccag 


cctccagcac 


agactggctc 


ctgtcctttg 


ctgtctatga 


1380 
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gcgacacttc 


ctcaggacga 


cactaaaggc 


actgcccgag 


ggcgcctgcc 


acctcagctg 


1440 


cctgcagata 


gcatccttcc 


tgctctccaa 


gcagagccgc 


ctgaccggtc 


ccagcgggct 


1500 


cagcagctac 


cacctgaaga 


cggccctact 


gcacctccta 


ctcctccggc 


aggccgccga 


1560 


ctggaaggcg 


gggcagctgg 


acgctcgtct 


gcacgagttg 


ctgtgcttcc 


tggagaagag 


1620 


cttgctccag 


aagaagctcc 


accacttctt 


catcggcaac 


cgcaaggtgc 


ctgaggccat 


1680 


gggactccct 


gaggccgtgc 


tcagggccga 


gcccctcaac 


ctcttccggc 


ccttcgtcct 


1740 


gcagcgaagc 


ctttaccgta 


agacactgga 


ctccttctat 


gagatgctca 


agaatgcccc 


1800 


agcgctcatt 


agcgagtatt 


ccctacatgt 


cccctcagac 


cagcctaccc 


caaaaagctg 


1860 


acgtctttta 


cagaatgtgg 


gatcctcgag 


ctaagatgag 


ggcatccctc 


acgttcacac 


1920 


ccctggtggc 


atctgccagc 


cctgttctgg 


ggacaaggcg 


ggctttcgtg 


ggagccatgc 


1980 


tcagcctgcc 


aggaagccaa 


gccctacagt 


gcagaggaaa 


cagaatttca 


acgggaagct 


2040 


ggtttgcttc 


ataccattgg 


gatctgctgg 


taaagctgtt 


atttgggttt 


agggactgat 


2100 


cccttgcagt 


ttacttctgg 


atcaccatga 


atggccaaga 


tggtggcaga 


acacgctgtg 


2160 


gaccctgagt 


tagagacaat 


gcaaatgttg 


gattgggtgt 


aattcttttt 


gaatcccaga 


2220 


tccagtctgt 


acttgaatat 


gagcagagga 


tctacaagaa 


tgctgacagg 


gaaccgtgtt 


2280 


aagacccagc 


acccctattc 


ccaggagctt 


ctggcctgac 


catctgcagc 


caaagcacta 


2340 


acagggacag 


atatgggaat 


gtccaccttt 


gatccgcatc 


ctgcacaata 


gtggtcccac 


2400 


catggctgcc 


acttttttat 


actatttgga 


gaaaagacct 


tgtataaatt 


cgaggcccga 


2460 


gtgactaacg 


tctctgtcac 


acggaaatgg 


gtacttggtg 


gcatagagaa 


acacaattag 


2520 


ccactttttc 


agctacactt 


ctcactcagc 


tgcaccctac 


acttctcact 


caggtgcacc 


2580 


cccttctgct 


gtcctttccc 


caacgtactg 


ggtcccgagc 


gtggtgggta 


tttgccacac 


2640 


tgggtgccag 


ctcagcagcc 


ccccacctct 


ctttattctc 


tccaaagctg 


gtctttctga 


2700 


ctatcattgt 


ggtaggggga 


ggacagatgc 


taaaggtgga 


agctgacctg 


gagaaagaga 


2760- 


cacacqqqqt 


gactgtggca 


aaggacagct 


ggaaaagaaa 


ctctatcact 


tcttcattgg 


2820 


caaccacaag 


qcacctqaqq 


ccatggcact 


cccagaggct 


gtgcgcagag 


ccaagcctct 


2880 


caacctcttc 


tggccctgcg 


tcctgcagcg 


aagtctctgc 


tgtaagacag 


tagactcctt 


2940 


cgatgaggtg 


ctcaaaaatg 


ctacccgggg 


tggtggtgtc 


tggcttgcag 


tctggcccag 


3000 


ttcagagaaa 


gttgcagaga 


tcaggggcca 


aggatgtcat 


agccccaggt 


tgtcctcagg 


3060 


gtcccaatcc 


tagggcaggg 


tgtgcatgga 


agcaagaact 


atggaaacct 


agctccagtc 


3120 


tgcaggctct 


gagcccctag 


ttcctcactc 


cagcggggct 


ccctcactgc 


acagaaccca 


3180 
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ccccttctgt 


gtgggcactg 


ctgaccacac 


agatgaccca 


gacccaaaga 


gcctggcaga 


3240 


agctctgtgg 


ttggagctgg 


gctccgtctc 


caggtctggt 


tcagggggat 


caggaaggct 


3300 


cttttccacc 


tgtggcttca 


ctggcccttt 


gagatttcct 


atctcaccgt 


tacttcagtt 


3360 


acccttgcag 


ggggccaggg 


agtcaagaat 


ataccgtgtt 


cctccagggt 


ttaagccggc 


3420 


catgccttcc 


cgagagcata 


accaacttga 


caggggtgcc 


cagttacccc 


acaaactgaa 


3480 


ggaaggagat 


ccttccccca 


tccccaggag 


tgctctcaac 


cagcctcaga 


aagcttgaga 


3540 


agatggaccc 


tttgcccacc 


agggttaatt 


cctggtgggg 


cagctcggct 


gtgatcaggg 


3600 


caaccaaacc 


tataggaagc 


cttccagtgt 


gagctggaat 


tagactgaac 


atgtgcttgg 


3660 


gcctgcctct 


ccctagacgc 


agttgcgggg 


cactccaggg 


aatgaaccag 


ctcaagtgtg 


3720 


tccctaacag 


cagcctggag 


ctacccccaa 


tccctcacag 


cctgaccctc 


ctcattccat 


3780 


cagatgcatt 


tgtagaatcg 


gggcaaattt 


ccttatttat 


ttatggccca 


tgcctttccc 


3840 


cctcttcatc 


ctgatcccgt 


tttgctttga 


agagacccca 


gtaaccaaaa 


aacagcctcc 


3900 


agaagccaaa 


accatgcctg 


gatctcccat 


agcttctcct 


ttgcttccag 


gagaaagttc 


3960 


actgaaaaaa 


aaatatcttc 


tggcttcttg 


tgtgtacaga 


gacaacagaa 


ctcggtgggg 


4020 


aaacgggaat 


cttttctgca 


ccaaagctgc 


ttctaaagca 


gaaagcagtg 


gggctcttgg 


4080 


tgtttcatgc 


tgccttattt 


atattaaagg 


aagaattaaa 


tcttgcaagg 


agtaaaaatg 


4140 


gtcactgttt 


ggtttttaaa 


cttcaggaaa 


tctgtgaggc 


tcccagggca 


gagttgggga 


4200 


caggggggtg 


gatttcctat 


tgacaaagca 


gaagctttca 


ctcccttttt 


attcttacac 


4260 


tttcaggaga 


ctttaaaaaa 


taacaaaaca 


gaacatttct 


accctgtctt 


tgaggcaact 


4320 


tgtgtgctgc 


ccgcagggct 


ccagaaaggg 


cctttgaaag 


ctggctgtag 


ggtggattcc 


4380 


agcctaagac 


cttatttcac 


agagctaatg 


tcagcgagaa 


atataggagc 


tggaactaac 


4440 


aggaagcttt 


tgggttttaa 


agaaagagat 


tgtattttaa 


atagaggatt 


ttgagtcctc 


4500 


tagggcactt 


tggcctttcc 


actaaggaca 


accctgccct 


ctttccccag 


agtctctttc 


4560 


ccaaggatgc 


ttgactttcc 


ccagcaactg 


ctcacttgct 


ttgcaaaata 


acaagggtga 


4620 


aatgaataga 


gttggcccct 


aggccaagag 


gctggttatg 


ttggggaagg 


gccttatatc 


4680 


ctcaaaagct 


ctagcccctc 


tgcatgcctg 


gctgaagcac 


agccccgcgt 


tgagaacctg 


4740 


ggggtggagc 


tctggttcat 


catcaagggt 


gtcatccgaa 


tcacccagac 


aacctggtcc 


4800 


attgccccga 


ccccgccttg 


aacctacatg 


gtgctcagaa 


tccagtctgg 


agtggtcagg 


4860 


ctaggctcga 


tttttaaaga 


tttcccccaa 


gtgtctgatg 


tacccctagc 


tgaggaagag 


4920 
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ctcagcttta 


tggtaccgac 




<210> 49 

<211> 782 

<212> DNA 

<213> Homo sapien 




<400> 49 
accattctga 


acgatgttaa 


aacaaatatq 


aaatatgaac 


ttacaaggag 


ooDcactcat 


tgatgaaata 


agccaaaata 


atcctaaaat 


ataaatgtta 


gaaagagccc 


ataattttag 


tgctattaaa 


actcagattc 


aagggaaata 


ttaattcaac 


agcaccatgt 


tagaaatata 


acatttgttt 


cacccagact 


tcaaactcta 


acaataaaga 


ctccaacgga 


aaaaacctot 


aaataatgtg 


ctttcagtgc 


cttagttaca 


ttctgtagta 


tgtaggtgtg 


ttttcttagg 


tggtctctaa 


acactggtca 


ctgtagcagg 


cctccatgct 


gaggtagggg 


gtgccctaaa 


ggtttgctac 


ctgataagct 


gggggatcca 



ca 

<210> 50 

<211> 2675 

<212> DNA 

<213> Homo sapien 

<400> 50 

ggtttatcct aaattactca accccttgta 
tgaaacacaa aaattaatcc ttaataatga 
cttccttttt caatagtaat atttaaaccc 
ttgtcatttg gagtcagtgg aaaatccagc 
aagaaaagtg ttgtcatttt cactctgttg 
ttttttttgt cgtgtaagaa ggatgctggt 
agtctttagc tgttagcaaa cctgttagtt 
tacttactgc tttatgtact ctttgggcat 
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4940 



gtttatttat ggcatgaacc 


atgtaacttg 


60 


taggtaacaa gttcttacac 


caaacttcct 


120 


tcattaaaaa gaacttgata 


aaagactcaa 


180 


gactcctata aaattcttcc 


tgtttgttaa 


240 


ccagcttcca cttgagtcac 


tttgaaatag 


300 


ttggcagcca agactctgaa 


ctctgcagaa 


360 


gccctgacta tgatgcccct 


gtgtgcattt 


420 


tggtgttaaa ttgtttacat 


ttctttatac 


480 


gctccctttc tgtttcttgt 


tccaaaggga 


540 


taaagtctct tafctgctact 


gaaagggaaa 


600 


taaacactac tctaacgtgg 


gagaaatgag 


660 


gcctgttatt tatgctgtga 


aaacgaaatg 


720 


tggcctagct gttcctgggt 


gaatgtttcg 


780 






782 


gccttgacaa attttacctt 


aaaaccaaaa 


60 


tagcaagtga tctttctttt 


tagttttagc 


120 


actcttgacc aattgtttgc 


ccaaatattc 


180 


acaccaagca ccagtcttct 


tctgaggcaa 


240 


gagctgcaca ctttttttct 


ttttcttttt 


300 


cagagctgca gaaaatatga 


ggcaattaaa 


360 


ttacttctgc attgaaccag 


cctcagaagc 


420 


taatgccttc tctgtaatta 


tatctcgttt 


480 
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ttgcttgaca 


gtgacctagc 


cagtaattgc 


atcgtgcatt 


gccatgaaag 


gtaaacacat 


540 


tgtgaactga 


acttaccaag 


cagattctgt 


aagaaagcgc 


tggttgaggc 


tgaacactgt 


600 


tgacacatca 


tttttattgg 


aagagtatta 


actggtgcct 


cttctgaaac 


acaccaaccc 


660 


atattcctct 


gctcccccaa 


agctgtttct 


gatcctgctg 


ggagcaacta 


actagttatt 


720 


atgcacatct 


gctccagacc 


cagctcttta 


acttcacggt 


tttacagctt 


gttttttctt 


780 


tttcttttct 


tttctttttt 


ttttttaaaa 


aagcaccttt 


ttttgttgtt 


tccccttaat 


840 


aaaagaggtt 


tctaatttat 


gtttctctaa 


gatttccttt 


ggttgtattt 


agaaacacaa 


900 


atagttttat 


aatcaggatt 


gcatttgtgc 


tgggtaaaag 


aggagagctg 


tgctttctac 


960 


cccaaagttg 


ttggtaatgg 


agccggtctg 


cccttcctac 


aaactcaaga 


ggctcattgt 


1020 


tcaaggtgta 


acaacattta 


attgcatgtc 


ctctaacgct 


ttgaaacctt 


tagagggaag 


1080 


atcctcattt 


tttacaattt 


tgttcctttt 


catcatagaa 


aacatgttta 


agataaaata 


1140 


caaactttac 


ctaccccaaa 


ttcataaagt 


acatttagtg 


ctgtatagca 


ctaaaactta 


1200 


gagatacaga 


cactgtactt 


actttttaag 


aattagagac 


agtaactcca 


aaaataattg 


1260 


tccttttttc 


tttctttttt 


tttttttaca 


ataaggctct 


tgaaaattgt 


cattacttgt 


1320 


gttttcctat 


acattcatct 


gtgtgaaagc 


ctttttcttc 


tttgatttaa 


aaaaattata 


1380 


cagttaatgg 


tttagaactt 


agaactactt 


agaattaatg 


ctaaagtgtc 


aggaagaaat 


1440 


taatttagct 


tcaataattg 


tgactggcct 


caggaattct 


cccttccaca 


cctgcccacc 


1500 


tcacctcacc 


tcacctcacc 


gcaccgcacc 


gcaccgcacc 


gcaccgcacc 


agagccagag 


1560 


cagctgcttg 


tctgcagcag 


gacacggttc 


ctacatacgt 


ttcagttctt 


tcatggtaag 


1620 


ctcaatggac 


tttgaattgt 


ttacagtgct 


gtatgtccaa 


ttgttaaatg 


taccattctg 


1680 


aacgatgtta 


aagcaagtgt 


ggtttattta 


tggcatgaac 


catgtaactt 


gaaatatgaa 


1740 


cttacaagga 


ggggcactca 


ttaggtaaca 


agttcttaca 


ccaaacttcc 


ttgatgaaat 


1800 


aagccaaaat 


aatcctaaaa 


ttcattagaa 


gaacttgata 


aaagactcaa 


ataaatgtta 


1860 


gaaagagccc 


gtaattttag 


gactcctata 


aaattcttcc 


ttgtttgtta 


atgctattaa 


1920 


aactcagatt 


caagggaaat 


accagcttcc 


acttgagtca 


ctttgaaata 


gttaattcaa 


1980 


cagcaccatg 


ttagaaatat 


attggcagcc 


aagactctga 


actctgcaga 


aacatttgtt 


2040 


tcacccagac 


ttcaaactct 


agccctgact 


atgatgcccc 


tgtgtgcatt 


tacaataaag 


2100 


actccaacgg 


agggagcctg 


ttggtgttaa 


attgtttaca 


tttctttata 


caaataatgt 


2160 


gctttcagtg 


ccttagttac 


agctcccttt 


ctgtttcttg 


ttccaaggat 


tctgtagtat 


2220 
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gtagtgtgtt 


tcttaggtaa 


agtctctttt 


tgctactgaa 


agggaaatgg 


tctctaaaca 


2280 


ctggtcactg 


tagcaggtaa 


acactactct 


aacgtggaga 


aatgagcttc 


atgctgaggt 


2340 


agtggttgcc 


ttagagctgt 


tatttatgct 


gtagaaaacg 


aaaatggttt 


tgctacctga 


2400 


taagcttcag 


attagatata 


gccctaaagt 


tatcctgtac 


ctgcattaaa 


ttttcatttt 


2460 


aaaaaaaaat 

^ %A I* 


cttaatttta 


gtaattcatt 


tttttttaag atcattatgt gcaaaacacc 


2520 


aagttttaaa 


aataactcac 


agaatggcct 


tagttttcaa 


tgtctgatgg 


tatatttgtg 


2580 


agttgtgtta 


tctgtaatat 


acatcccaga 


ataaaggagt 


gaattaaata gtttaaaaaa 


2640 


aaaaaaaaaa 


aaaaaagatc 


tttaattaag 


cggtc 






2675 


<210> 51 

<211> 168 

<212> DNA 

<213> Homo sapien 












<400> 51 
acaacacaac 


ctgttaggta 


gcaggctggg 


tattacctga 


gtaaaaagtc 


cctattcttt 


60 


ccatttctcc 


atgctattta 


aaagcatgtg 


gtgatccttc 


aataaattaa 


agatataaca 


120 


aattatttga 


tgccattagg 


cggcctggat 


caccaattct 


aagtcttt 




168 


<210> 52 

<211> 1139 

<212> DNA 

<213> Homo sapien 












<220> 

<221> misc_feature 
<222> (291) . . (576) 
<223> a, c, g or t 












<400> 52 
caaaattgga 


ttcttcaaaa 


cgccaggttg 


gcctttcttg 


actccaaaag 


aggtgactct 


60 


tgtagcattc 


ttacaggcct 


aacatagcac 


ttactttatg 


ttgcaatgct 


gtgtgcaatg 


120 


ctgtgttttt 


ttactcacta 


gctctgagca 


aaacattatt 


tattcttggt 


aattgtaaca 


180 


cttggtagtg aagacatatt 


ttttggccat 


ataatggatt 


atactttgta 


tttgaatcca 


240 


tgtgaatcag 


tagcatattg 


gtcttcattg 


tttttcctcc 


ttctccgacg 


nnnnnnnnnn 


300 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


360 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


420 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


540 
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nnnnnnnnrin 


nnnnnnnnrin 


nrmnnnniinn 


nnnnnntaac 


catatttttt 


ccaattatta 


600 


aaaaatcttt 


tttcccccta 


aggaatgtgt 


ttctaaggtc 


accatgtttt 


tagtgcttca 


660 


gtttgacttg 


tctacatcta 


aagttaactc 


tttaaggctt 


acatttatga 


ttataattct 


720 


ttccatcgag 


ggatttagtc 


tccattattg 


tcatgtttct 


tgattttttt 


cattatactt 


780 


agttcttctt 


ttgtctaggt 


tttatatata 


ccttaattaa 


ttctttcttg 


gctgttttat 


840 


atctatgttg 


ctgtcagcat 


gggaatctct 


attattagaa 


agctattaac 


ttttaaaata 


900 


tgtatctttg 


gtgatttctt 


tgtcctaaca 


gtttagatgg 


cctgttgctc 


ttctaaatct 


960 


taaatcatag 


gtattctatt 


ttggttgttg 


tgtttcatta 


cactgtcata 


gtttttgata 


1020 


ttttcattac 


tattgaggta 


ctgtggttta 


ccaatgttct 


gttggtacga 


atactttcca 


1080 


tctgttgggg 


agctcaaact 


gggaaccagc 


acgttcgtga 


tgttcgcacc 


ccccacccc 


1139 


<210> 53 

<211> 681 

<212> DNA 

<213> Homo eapien 












<400> 53 
ggtacgggga 


gaacatatca 


aaaaggggaa 


ggatggattc 


ccttgatgcc 


caggattaca 


60 


gtggcaccta 


aagcacattt 


ttttttttct 


gagccaacca 


gctaaaggat 


cactgcagct 


120 


aaatacagat 


agagaagcaa 


caaagccagg 


caaataccca 


tcagagacag 


tgacaagagc 


180 


agctgggggc 


acgggggagg 


cggaaggaag 


agaaagaagg 


ggaggagcct 


ccagagtccc 


240 


agccccaacc 


ccctctgcca 


tgggctaccc 


ttgctcccca 


aaaatcccgt 


tggggttgaa 


300 


gtgaggagga 


ctgcaggctg 


gggtgaaaat 


acacaaggac 


agcccaacaa 


aatacaacaa 


360 


ggactagcat 


cagtctcccc 


cttactccga 


cccccaagaa 


aaataccctt 


attggtgact 


420 


agtatttatg 


aaaatctgta 


aggagactat 


tctatgtagt 


gggctctaat 


cccatataca 


480 


cagcagctgc 


ctgtgtgggg 


aaccttttca 


aatcagtgat 


tgcgggaaca 


aacagtattt 


540 


tcagcttctt 


acggtgccca 


tgcaggcttt 


taccaagacc 


ttggttaagt 


cccagtcaca 


600 


tttactttct 


gtcttcatct 


agaaaagggt 


gaggaaagag 


ggagggacaa 


ggtgctccag 


660 


gtgctagtaa 


gctgggtatc 


c 








681 



<210> 54 

<211> 3191 

<212> DNA 

<213> Homo sapien 



<400> 54 
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gtgctgccga 


gtagtcccgg 


aagcgaagca 


gcgatggcgg 


agagtccgac 


tgaggaggcg 


60 


gcaacggcgg 


gcgccggggc 


ggcgggcccc 


ggggcgagca 


gcgttgctgg 


tgttgttggc 


120 


gttagcggca 


gcggcggcgg 


gttcgggccg 


cctttcctgc 


cggatgtgtg 


ggcggcggcg 


180 


gcggcagcgg 


gctcccgccc 


tcagccgctg 


cccacggggc 


cgcgctgctt 


agccactggg 


240 


accccacgct 


cagctccgac 


tgggacggcg 


agcgcaccgc 


gccgcagtgt 


ctactccgga 


300 


tcaagcggga 


tatcatgtcc 


atttataagg 


agcctcctcc 


aggaatgttc 


gttgtacctg 


360 


atactgttga 


catgactaag 


attcatgcat 


tgatcacagg 


cccatttgac 


actccttatg 


420 


aagggggttt 


cttcctgttc 


gtgtttcggt 


gtccgcccga 


ctatcccatc 


cacccacctc 


480 


gggtcaaact 


gatgacaacg 


ggcaataaca 


cagtgaggtt 


taaccccaac 


ttctaccgca 


540 


atgggaaagt 


ctgcttgagt 


attctaggta 


catggactgg 


acctgcctgg 


agcccagccc 


600 


agagcatctc 


ctcagtgctc 


atctctatcc 


agtccctgat 


gactgagaac 


ccctatcaca 


660 


atqaqcccqq 

*** w 3 3 33 


ctfctgaacag 


gagagacatc 


caggagacag 


caaaaactat 


aatgaatgta 


720 


tccggcacga 


qaccatcaqa 


gttgcagtct 


gtgacatgat 


ggaaggaaag 


tgtccctgtc 


780 


ctgaacccct 


acqaqqqgtq 


atggagaagt 


cctttctgga 


gtattacgac 


ttctacgagg 


840 


tggcctgcaa 


agatcgcctg 


caccttcaag 


gccaaactat 


gcaggaccct 


tttggagaga 


900 


agcqqqgcca 

3 3333 


ctttgactac 


cagtccctct 


tgatgcgcct 


gggactgata 


cgtcagaaag 


960 


tqctqqaqaq 


gctccataat 


gagaatgcag 


aaatggactc 


tgatagcagt 


tcatctggga 


1020 


cagagacaga 


ccttcatggg 


agcctgaggg 


tttagaccct 


gctcccatct 


ccccttcccc 


1080 


cactcaagag 


tcccagcaga 


atcccttccc 


cccaccccag 


ggatggagag 


gcactgtgta 


1140 


tctccctcca 


gactcgaagt 


catcctgcaa 


gatggcaaga 


accaagcaag 


ctccgatccc 


1200 


agggtgtggg 

•933 3 333 


agtgggggcc 


tgttcccggt 


ctgacctcct 


tggcactgga 


gcatctgggg 


1260 


cttcgttcat 


ccattcatcc 


cgtatcaggg 


ggccaaggta 


cctttacagg 


agcacctaga 


1320 


gcgagggcct 

3 w 3** 3 3 3 


ttggcaaaaa 


caaaacaacc 


aacacacctc 


tccacagggc 


cagctcctta 


1380 


qqqataaqtq 

333"-* **3 3 


gaagatggaa 


attgcaattc 


caagagggag 

3 33 3 3 


tgtgcccaaa 


tgatttatgg 


1440 


aaatacctcia 

313 Wi v»w w w 3 3 




qqqqtqqqaq 

3333 **33333 


ctgtctgtga 


cacttaagca 


gtctgggtgg 


1500 


ttgtctattt 


gtctgtcttc 


agtcttgaag 


cagggcttcc 


caatgccctt 


ttcctccctg 


1560 


ccttccttcc 


cccattattt 


cccacaggcc 


agcataattt 


tgtttttcct 


aatttatagt 


1620 


cactgttcta 


gacagaccaa 


agagaaggaa 


cagtggtgga 


gtctaggctg 


ctgatcagta 


1680 


agctttacct 


agcacctgag 


cacctttctc 


ccctcccctc 


tttcctcacc 


cttttctaga 


1740 


tgtaagacag 


aaagtaaatg 


tgactgggac 


ttaaccaagg 


tcttggtaaa 


gcctgcatgg 


1800 
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caccgtaaga 


agctgaaaat 


actgtttgtt 


cacaggcagc 


tgctgtgtat 


atgggattag 


tttcataaat 


actagtcaca 


ataagggtat 


gatgctagtc 


cttgttgtat 


tttgttgggc 


gtcctcctca 


cttcaacccc 


agggatttgt 


gttggggctg 


ggactctgga 


ggctcctccc 


cccccagctg 


ctcttgtcac 


tgtctctgat 


tctgtattta 


gctgcagtga 


tcctttagct 


aggtgccctg 


taatcctggg 


catcaaggga 


cccgtacttc 


cagatttatt 


gttatggctc 


gggcctcagt 


cagtgtccag 


ccatgcataa 


tctgctatga 


aggtctctgc 


cttgtggatc 


ggggggctgg 


gcacaaagga 


gaatgtccta 


cagggattgg 


tgggcttggg 


gaacggaagt 


ctcttcacat 


gaagatcgaa 


aagggaccct 


ctactccagg 


gctcagaaga 


atgctcttgg 


ttaagtgttc 


ccactttcaa 


gtgacaatcc 


gtctggcata 


gcagcctgac 


ttttatgccc 


gagtcaaaga 


gatgtcttta 


tatctgactg 


tcccttttgg 


ctgccatcaa 


cgtcgtcttg 


tggttcagca 


cgcattggtc 


ctttgtcacc 


cgattgttgg 


accccttctc 


tgtcgcggcg 


cccgccttgg 


gcccgcccct 


ggggctcagg 


cccgcaccgg 


t 





<210> 55 

<211> 385 

<212> DNA 

<213> Homo sapien 
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cccgcaatca 


ctgatttgaa 


aagttcccaa 


1860 


agccactaca 


tagaatagtc 


tcttacagat 


1920 


ttttcttggg 


ggtggagtaa 


gggggagact 


1980 


tgtccttgtg 


tattttcacc 


ccagcctgta 


2040 


ggggagcaag 


ggtagccaat 


ggcagagggg 


2100 


cttctttctc 


ttccttctgc 


ctcccccgtg 


2160 


gggtatttgc 


ctggctttgt 


tgcttctcta 


2220 


ggttggctca 


gaaaaaaaaa 


aatgtgcttt 


2280 


atccatcctt 


cccctttttg 


atatgttctc 


2340 


ccagtgggta 


ttggcgattc 


ttgtgatgca 


2400 


gggagaggat 


agtgtgtacc 


tgccctgccc 


2460 


atgggactcc 


ccttggagga 


tctgtgcaaa 


2520 


tttgggaggg 


caggaagcaa 


aggaactgga 


2580 


ttatcttgga 


tacccttgaa 


gaggctgggt 


2640 


gcttccaatt 


tccctcttcc 


attcctcgag 


2700 


tctgtgggtc 


cagtgttgtc 


tgtcatccat 


2760 


tctccttggc 


cctgccatag 


ggcagagcat 


2820 


taatcttgag 


ttgaggaaat 


atatgcacag 


2880 


ttataaatga 


agttttgcgt 


tctgttgggt 


2940 


cacgccagac 


gtgctgggga 


ttggagggct 


3000 


ccctcctagg 


ccgttgtggg 


gcacgacgcc 


3060 


gcgtggcgca 


cgccttctcg 


agggccacgc 


3120 


aggcctcctg 


gtgcgcgtct 


ccgggcgcct 


3180 



3191 



<400> 55 

actgctaacc cctgccaggc ccagctgcca caccctttct gggagaagca tggcctacag 



aatgaagagg gggaccagga acccctgtgg gagaggctta gacctgaagc agtgcccact 



60 
120 
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ctggctcctc 


ctgccttggc 


tgactgggtt 


cctggaccat 


gtgcatttca 


ctgggccatg 


180 


ggatctacat 


ctccttgcat 


ccccagctgg 


tctgatccct 


gccagggccc 


cttccttcct 


240 


gctcatggtc 


ttcaggtggc 


ctgatcatgg 


aaagtaagga 


gttaggcatt 


accttctggg 


300 


agtgaaccct 


gactccatcc 


ccctattgcc 


accctaacca 


atcatgcaaa 


cttctccctc 


360 


cctggggtaa 


ttcaacagtt 


aaaag 








385 


<210> 56 

<211> 1977 

<212> DNA 

<213> Homo sapien 












<400> 56 
ttcacccggc 


cctcgcctgt 


caccttcaca 


gagaaggggc 


tgcctgcagg 


aagaaagcac 


60 


ggcccacgcc 


cctccagtca 


catactgcct 


gtgggccctg 


gtgtagtgag 


gggggctgcc 


120 


gaggcactgc 


tgcactcacc 


aggcacgtgc 


tggtcggcaa 


acttgatgtt 


gatgatgtag 


180 


ttgcctggct 


ctgtggggca 


gtaggtcacc 


tggcgggaca 


ggtgcctggc 


tgctacaaac 


240 


catgcaatga 


gccatgcccc 


gccctggaca 


cccccgccca 


gcatctgggc 


ctccacgctt 


300 


gggaccgtgg 


gagcggccaa 


cagagctatg 


tctggagaca 


tatgataaac 


cacctcagcc 


360 


cccaccaagc 


cgccgcaccc 


gtagaccaga 


ccccaaggac 


cctggccacc 


atgggccaga 


420 


gagcattacc 


ttcatctctg 


gctctgctga 


gccggccctt 


gagtccccca 


cctgctgcct 


480 


gctctggcga 


ccctgggtgt 


gggagtggtg 


ccgggctgcc 


ttctgcttcc 


gcccgctgcc 


540 


gggattgcct 


ccagcgctgt 


ggaggcctgt 


gtgcggggat 


gcagcccctg 


cctgtctact 


600 


gaggactccc 


actgagggga 


ctgctgaagc 


caactggtgc 


caaggagcac 


aatggagtgc 


660 


cccccagccc 


tgatcgtgca 


ccccccagac 


cggcgggatg 


gccaggcggg 


ctgcaagtca 


720 


accatgggca 


gcagcttcag 


ctaccccgat 


gttaagctca 


aaggcatccc 


tgtgtatccc 


780 


tacccgagag 


gccacctccc 


cagcccctga 


tgcggactcc 


tgctgcaagg 


agccactggc 


840 


cgatccccca 


cccatgcgag 


cacagcctgc 


ccagcacctt 


tgccagtagt 


cctcgtggct 


900 


ccgaggagta 


ctattctttc 


catgagtcgg 


acctggacct 


gccggagatg 


ggcagtggct 


960 


ccatgtcgag 


ccgagaaatt 


gatgtgctca 


tcttcaagaa 


gctgacagag 


gctgttcagc 


1020 


gtacaccaga 


tcgatgagct 


ggccaagtgc 


acatcagaca 


ctgtgttcct 


ggagaagacc 


1080 


agtaagatct 


cggaccttat 


cagcagcatc 


acgcaggact 


accacctgga 


tgagcaggat 


1140 


gctgagggcc 


gcctggtacg 


cggcatcatt 


cgcattagta 


cccgaaagag 


ccgtgctcgc 


1200 


ccacagacct 


cggagggtcg 


ttcaactcgg 


gctgctgccc 


caaccgctgc 


tgcccctgac 


1260 



WO 02/068633 PCT/US01/43612 

48 



agtggccatg 


agaccatggt 


gggctcaggt 


ctcagccagg 


atgagctgac 


agtgcagatc 


1320 


tcccaggaga 


cgactgcaga 


tgccatcgcc 


cggaagctga 


ggccttatgg 


agctccaggg 


1380 


tacccagcaa 


gccatgactc 


atccttccag 


ggcaccgaca 


cagactcgtc 


gggggcaccc 


1440 


ttgctccagg 


tgtactgcta 


acccctgcca 


ggcccagctg 


ccacaccctt 


tctgggagaa 


1500 


gcatggccta 


cagaatgaag 


agggggacca 


ggaacccctg 


tgggagaggc 


ttagacctga 


1560 


agcagtgccc 


actctggctc 


ctcctgcctt 


ggctgactgg 


gttcctggac 


catgtgcatt 


1620 


tcactgggcc 


atgggatcta 


catctccttg 


catccccagc 


tggtctgatc 


cctgccaggg 


1680 


ccccttcctt 


cctgctcatg 


gtcttcaggt 


ggcctgatca 


tggaaagtaa 


ggagttaggc 


1740 


attaccttct 


gggagtgaac 


cctgactcca 


tccccctatt 


gccaccctaa 


ccaatcatgc 


1800 


aaacttctcc 


ctccctgggg 


taattcaaca 


gttaaaagaa 


gcttatctta 


aatgtattgt 


1860 


attggggggt 


gggcagggcc 


cactctatgt 


tatgttaagg 


agttggttct 


ggttcttggc 


1920 


tgatgttctg 


tatcttaaca 




L- Uctciy I. ell* 


uuy l. ^- *—y ^y 


a rra fete* 


1977 


<210> 57 

<211> 629 

<212> DNA 

<213> Homo sapien 












<400> 57 
tgggtgatct 


agtctgtcga 


gcgcgcatgt 


gatggatagc 


gtggtcgcgg 


gcccggaggt 


60 


tacatggtgg 


cctatacaaa 


agcactccat 


gtttctgcct 


gaataacgta 


cgttgatcgc 


120 


ttggttgcct 


gagcaaatgt 


ataagggagc 


tgcttggagg 


ggaaaggaac 


acaataaaac 


180 


gcctctggag 


gtatttcaaa 


gagtagtttc 


tcaaatctct 


ctgattcagg 


aagaggatga 


240 


cgagagggag 


agaacgtgga 


attacttaaa 


gtcatctaac 


tcattggtct 


tgttcaacaa 


300 


gaaagaattc 


tggtttgtgg 


ctgaatctga 


tttaacagct 


gccaatagct 


cattacttct 


360 


aagatgtata 


tctaattcaa 


aactagatgc 


tcccccctcc 


ctcttttttc 


cctaaaagct 


420 


gggaacttgt 


aagtaaatgg 


cctagtgaaa 


tagacactgg 


ctattttaga 


aaaagggtca 


480 


gaaggtagac 


caggaactat 


gtatcaagaa 


gaagtaagtg 


aggctcaagc 


ccaggccacc 


540 


taggctgtgt 


ttattcagtc 


cacccaatcc 


aatgtcaaag 


ccccaagtgg 


gccagtgtta 


600 


cctttatctt 


cccatctttg 


tatttctga 








629 



<210> 58 

<211> 3535 

<212> DNA 

<213> Homo sapien 
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gtgataggca 


gctttccttc 


ttttcaacag 


gctgaggttt 


tgtgctcact 


gaaagggctg 


ggtatgtgaa 


gatgcaccgt 


cttttcaaat 


ggggcctgcc 


cttctctgct 


gtgtcctttc 


cccactgggg 


catataaaat 


ctctgctgta 


accaggaccc 


actggggcat 


ataaaaaagt 


gttacttagg 


aaacagactt 


cgcatttcaa 


tctggcttct 


agacaaattc 


ctgatagaac 


cccatcccct 


gtttagggat 


agagttgata 


cctgaatttt 


tttaattgac 


ttttgagctt 


tgctgttgtg 


gataggaaag 


acttaaccta 


aatctaacaa 


tatgaagggc 


tcttatgagt 


acgcacagga 


acgaaatacc 


tcccagaaac 


tatttttaaa 


aagtatacag 


atcaaagcaa 


gcaaatattt 


tttaaggcag 


tattaagtgc 


cacatggcta 


ctgggaatat 


aaatttcgct 


tggcaaaacc 


ttaagattgt 


gtactggggc 


gaggaaatta 


tccgagatcc 


ccacaaactg 


catacacaag 


aaaaacagag 


aaaagcctga 


ttacggtgtg 


tctgcatgag 


gcttttatga 


cagtggctca 


tgcctgtaat 


cccagaactt 


gtcaggagtt 


tgagactagc 


ctggccaata 


aaaattagct 


gggcgtggtg 


gcaggcgcct 


ggagaatcgc 


ttgaacccgg 


gaggcggagg 


tccagcctgg 


gtgacaagag 


tgagactccg 


aacaaaaata 


aagtctatgc 


ccattaagac 


ctacttaaaa 


aaatatttaa 


gcttgatgtt 


gaaaattaaa 


gtatgtgaca 


agatcaggtc 


tgaatagcaa 


aaaagtaaca 


ttacacagta 
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tgatacctac 


gaaaatcaaa 


ataaatgcaa 


60 


tcaaccccag 


aaggccgaca 


caaaaaaaat 


120 


qqcctqqqaq 


agtcaaatgg 


CCtqqqaqaq 


180 


ggcttcccag 


ttgagctccc 


aagaccagga 


240 


tcctttcggc 


ttcccagttg 


agctcccaag 


300 


caaaaatcaa 


aatcaaacaa 


caagttctga 


360 


tcaqaqaqqc 


cacagagcaa 


ggtctaaact 


420 


atttaaatgt 


qqqaaqtqqc 


ttccccaggt 


480 


tcatttttat 


aqqtqccatq 


tatgcctctg 


540 


ttaaqattqc 


acqaqqqaqa 


acaaggcctt 


600 


aaattaaacc 


agcaagaaag 


cattagtaaa 


660 


catttttttc 


aaaagatgaa 


aactccagaa 


720 


atgaagcaat 


catcgaagac 


tcactggtaa 


780 


aaagaagcca 


tgtgtaacaa 


agagaaatgt 


840 


aaaaQQaqta 


acatgaaata 


aacattcttt 


900 


ccagaaaggc 


cgtagcagtt 


tgacqatagg 


960 


ccagaatttt 


tatttctagg 


aatgtatcct 


1020 


caatgtttag 


gaattgtcct 


tatagcattg 


1080 


tccctgtcag 


tqqaaaaqqq 


gttcaatgaa 


1140 


cattaaaaat 


tgttgaacaa 


cqqcctqqca 

33 *3 j vw 


1200 


^333 *■* 3 3 w 


aqqtqqqcqq 


ataacttgag 


1260 


fcggfcgaaacc 


ccgtctctac 


taaaaataca 


1320 


ataatcccaa 


ctcrctcacrcra 


aactaaaaca 

33 3 33 


1380 


ttccagtaag 


ccaagatcgt 


gccactgcac 


1440 


tctcaaaata 


aaaaaaaaaa 


aaaaaagtaa 


1500 


gtcttctaat 


tcagttgtga 


ttgtctgctc 


1560 


taattattcc 


ctttcagcaa 


atttggatca 


1620 


accttgaatt 


tccacacaat 


ctcaagacac 


1680 


atgattagga 


tatttcctta 


gactttgctg 


1740 
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gatctttggt 


cttaaggtaa 


catgtaaaag 


tagtgaagcc 


tttcctttca 


tggccctgtg 


1800 


caatgtaacg 


gttttctgcc 


tcctcttcag 


ctggaagcgt 


tagtggtagt 


atgggcacag 


I860 


aatatatgta 


cactggcgat 


gctgaccatg 


cctcccaggt 


accctggctc 


tgggttcctt 


1920 


gacctaggga 


acaagattgg 


atgaggcaga 


tctttgagcc 


catgtgacta 


tagaatttgc 


1980 


tgatgatata 


attttacaat 


aacaatggat 


aggaatttta 


cctctctttt 


tattagttta 


2040 


atattattta 


atattatgta 


cataagtgtt 


cactcgccta 


attaaaaaca 


ttgagtaaac 


2100 


caagttttta 


tatagactac 


ccttgccata 


tgatgctctt 


tttctctaat 


aatatgcagt 


2160 


ttaaatcctg 


aggaatcaat 


gcccagcatt 


tcaccacatc 


tgaactctgt 


gtgggcattc 


2220 


ttcactcgcc 


tacaaggggt 


aaacaaggct 


accagaactt 


gaatttgact 


tatagggagc 


2280 


tacccaggaa 


ggggaaagcc 


cttgggactt 


tttccaaaac 


aatcttctat 


ttgaactgtt 


2340 


catcagccaa 


agtagtccac 


tgaggtgaca 


aagctttcag 


aaatacaaag 


atgggaagat 


2400 


aaaggtaaca 


ctggcccact 


tggggctttg 


acattggatt 


gggtggactg 


aataaacaca 


2460 


gcctaggtgg 


cctgggcttg 


agcctcactt 


acttctcctt 


gatacatagt 


tcctggtcta 


2520 


ccttctgacc 


ctttttctaa 


aatagccagt 


gtctatttca 


ctaggccatt 


tacttacaag 


2580 


ttcccagctt 


ttagggaaaa 


aagagggagg 


ggggagcatc 


tagttttgaa 


ttagatatac 


2640 


atcttagaag 


taatgagcta 


ttggcagctg 


ttaaatcaga 


ttcagccaca 


aaccagaatt 


2700 


ctttcttgtt 


gaacaagacc 


aatgagttag 


atgactttaa 


taattccact 


tttctctccc 


2760 


tctcttctcc 


tcttcctgaa 


atcagagaga 


tgagaaacta 


ctctttgaaa 


tacctccaga 


2820 


ggcgttttat 


tgtgttcctt 


tcccctccaa 


gcagctccct 


ttatacaatt 


ttgctcaggc 


2880 


aaccaaggac 


agagtatcgg 


cagaaacatg 


gagtgctttt 


gtataggcca 


cctgtacata 


2940 


aaagtgtaat 


tatttattta 


attttcccat 


ttgtatcata 


ttaaagcttt 


gtacagtgtt 


3000 


ttaagttctg 


ttttaaaatt 


attttgtatt 


ttatttttat 


aacctagtaa 


taaaatattc 


3060 


attccgcatg 


caaaaaaaaa 


cacacacaca 


acccaaacaa 


ccaaaaacaa 


acagaaccaa 


3120 


aagatagaaa 


ccaaacacag 


caacagacag 


acacaagaga 


agacgaaaac 


aaccacacaa 


3180 


acagcacaca 


caaccgcagc 


ggagaaccaa 


caaagccaaa 


cgcaagacag 


cacaacaagg 


3240 


gacaaaacac 


acacgtagca 


caaagcagcc 


gcagaaccga 


acacacatct 


aaagacaggc 


3300 


gacaacaaca 


tagcaggccg 


tagcaccgac 


aaccactaaa 


ccataactat 


ccagcggaga 


3360 


tcagatcaca 


gaagccgaca 


cagaaagaaa 


agcatgtatg 


caacgcatga 


caccgcaggg 


3420 


cggataaaaa 


cactcgtagg 


cgggaccgcc 


gcagaacgat 


cgaaggctaa 


caaacaagct 


3480 



WO 02/068633 

agtagatcac aagtacgtga ccacgccctc 

<210> 59 

<211> 348 

<212> DNA 

<213> Homo sapien 

<400> 59 



cgaatgatgc 


tatatagggc 


catgggtgca 


atggatcgcg gcgaggtact 


tctgccatca 


agataatcca 


tctctaaaat 


tcaagacttc 


ttggggttta 


taaccaatcc 


gatcattgat 


aagcccaatg gctccttctc 


catcactgcc 


cttacgggtc 


tccaggtcct 


ttcgaatgtt 


<210> 60 
<211> 2497 
<212> DNA 
<213> Homo sapien 




<400> 60 
ttatacccta 


tgctcgacca 


cttggagtgt 


aaaccacgct 


tgctcgtccg 


tgtttcttgg 


cgtggtctat 


acggaagggt 


ctaaaacttt 


tctttgtgtg 


tttttgtttg 


gatgcctcac 


cagaaaatca 


gatcagtttt 


atagagtcaa 


ccagcttaga 


gtgcaagtga 


agcatcagta 


ggtacattct 


gcttgactgc 


aagcactcaa 


cattttattc 


cattccagcg 


ctctggcatg 


tccaaattga 


gatgaacgat 


tattgggctt 


gatcatttgt 


tcccggctct 


tgggatcact 


gcctcctaca 


acttccacat 


cttccttctg 


gttctcaaac 


tcttcaatgt 


catcaagctt 


catactgaag 


tcaacactaa 


gggccaaacc 


cctccgcgtc 


caaacagatc 


ttaaatgtgt 


ttccagctac 


agagttaaaa 


ggtacagtca 


cccggaacct 


cggctccccc 


ccaacgaaac 
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ttatctataa 


aggaggaaga 


gggag 


3535 


ctgatgcatg 


gctcgagcgg 


ccgcagtgtg 


60 


ttttattcca 


ttccagcgct 


ctggcatgca 


120 


caaattgaga 


tgaacgatta 


tgggcttggg 


180 


catttgttcc 


cggctcttgg 


gatcactgct 


240 


tcctacaact 


tccacatctt 


ccttctgttt 


300 


ctcaaactct 


tcaatgtc 




348 


tcaacctcgt 


tcaccggggt 


tttaatggtg 


60 


aacgcttgcc 


ctgtaccata 


cctatgcgcc 


120 


tctacggcta 


atgcggccca 


caaagttttt 


180 


agtggtccaa 


aatatattta 


tttttaaaaa 


240 


attttcaaga 


gacaaaccag 


agtttggatt 


300 


atctcatgca 


gaagatttgt 


ctctgcagga 


360 


ttctctaaat 


ctggtttgta 


cttctgccat 


420 


caagataatc 


catctctaaa 


attcaagact 


480 


gggttggggt 


ttataaccaa 


tccgatcatt 


540 


gctaagccca 


atggctcctt 


ctccatcact 


600 


tttcttacgg 


gtctccaggt 


cctttcgaat 


660 


tagctctagg 


cgtgttggtt 


tccgtctcag 


720 


cagtgaacta 


ttagcaattc 


acaggattat 


780 


acgtcattca 


ttaaacaaac 


acttcttgtg 


840 


taatctcttt 


tcaagccggc 


ctagcccctt 


900 


tactgctaag 


ccaactggac 


tacacttccc 


960 
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agactgcttg 


gagcctctct 


ctccgcagaa 


cctcgtcttc 


cgcgagcttt 


tcctggaggt 


1020 


tctaggaggg 


atgcccctca 


atgccacgac 


gccatttcct 


actacgactt 


ccatcatgct 


1080 


ccgcgccgct 


ccgggggcgg 


tgacccctct 


tggcccacgt 


cttgtcagtg 


acgcacttcc 


1140 


tgcaaccaaa 


actaaagcac 


ccgacgactt 


agttgctccg 


gtcgtgaaga 


aaccacacat 


1200 


ctattatgga 


agtttggaag 


agaaggagag 


ggagcgtctg 


gccaaaggag 


agtctgggat 


1260 


tttggggaaa 


gacggactta 


aagcagggat 


cgaagctgga 


aatattaata 


taacctctgg 


1320 


agaagtgttt 


gaaattgaag 


agcatatcag 


cgagcgacag 


gcagaagtat 


tggctgagtt 


1380 


tgagagaagg 


aagcgagccc 


ggcagatcaa 


tgtttccaca 


gatgactcag 


aggtcaaagc 


1440 


ttgccttaga 


gccttggggg 


aacccatcac 


actttttgga 


gagggtcctg 


ctgaaagaag 


1500 


agaaaggtta 


agaaatatcc 


tctcagttgt 


cggtactgat 


gccttgaaaa 


agaccaaaaa 


1560 


ggatgatgag 


aagtctaaaa 


agtccaaaga 


agagtatcag 


caaacctggt 


atcatgaagg 


1620 


accaaatagc 


ttgaaggtgg 


caagactatg 


gattgctaat 


tattcgttgc 


ccagggcaat 


1680 


gaaacgcttg 


gaagaggccc 


gactccataa 


ggagattcct 


gagacaacaa 


ggacctccca 


1740 


gatgcaagag 


ctgcacaagt 


ctctccggtc 


tttgaataat 


ttttgcagtc 


agattgggga 


1800 


tgatcggcct 


atctcctact 


gtcactttag 


tcccaattcc 


aagatgctgg 


ccacagcttg 


1860 


ttgtgatgaa 


ccagtggcag 


atattgaagg 


ccatacagtg 


cgtgtggcgc 


gggtaatgtg 


1920 


gcatccttca 


ggacgtttcc 


tgggcaccac 


ctgctatgac 


cgttcatggc 


gcttatggga 


1980 


tttggaggct 


caagaggaga 


tcctgcatca 


ggaaggccat 


agcatgggtg 


tgtatgacat 


2040 


tgccttccat 


caagatggct 


ctttggctgg 


cactggggga 


ctggatgcat 


ttggtcgagt 


2100 


ttgggaccta 


cgcacaggac 


gttgtatcat 


gttcttagaa 


ggccacctga 


aagaaatcta 


2160 


tggaataaat 


ttctccccca 


atggctatca 


cattgcaacc 


ggcagtggtg 


acaacacctg 


2220 


caaagtgtgg 


gacctccgac 


agcggcgttg 


cgtctacacc 


atccctgctc 


atcagaactt 


2280 


agtgactggt 


gtcaagtttg 


agcctatcca 


tgggaacttc 


ttgcttactg 


gtgcctatga 


2340 


taacacagcc 


aagatctgga 


cgcacccagg 


ctggtccccg 


ctgaagactc 


tggctggcca 


2400 


cgaaggcaaa 


gtgatgggcc 


tagatatttc 


ttccgatggg 


cagctcatag 


ccacttgctc 


2460 


atatgacagg 


accttcaagc 


tgtggatggc 


tgaatag 






2497 



<210> 61 

<211> 604 

<212> DNA 

<213> Homo sapien 
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<400> 61 
catctgatca 


cgtcgcccgc 


cctagtctgc 


ttttgtgaat 


ctccactttg 


ttcaaccccc 


60 


acccgccgtc 


tctccctccc 


tgcgccttcc 


ctctaacctt 


aaagacccag 


ctacctctac 


120 


gcaaatggta gctacctctg 




agttgagttc 


qqttcacgcg 

33 w 3 3 


ggaccctctt 


180 


ccctgtggca 


agctgctgaa 


era aaac c t crc 


tgttactgct 


ttagctgctg 


ttctgcttaa 


240 


tctcagacag 


ctggaaggta 


gtattccgcc 


ctcgcccagc 


cacccgagta 


aatgagaatg 


300 


atgactgact 


gagaccctga 


L 33 *-3 3 u 33 v 


ttaggcttaa 


tctcagacag 


ctggaaggta 


360 


gtattccgcc 


ctcacccagc 


cacccgagta 


aatgagaatg 


atgactgact 


gagaccctga 


420 


tggtggtggc 


ttaggcaaga 


tgaacgtctg 


aaatggaggc 


ccagggctgc 


cacgatggct 


480 


ctgtagtcat 


cagggaggga 


gctcctttta 


tcttgttacc 


tactcccctc 


ttgtgcccct 


540 


ttcttccctt 


gatctaagaa 


qattactcccr 

33 *"3 w ** w w 3 


ggatctcgcc 


atctggcctg 


cgttcaatca 


600 


gcag 












604 


<210> 62 
<211> 4733 
<212> DNA 
<213> Homo sapien 












<400> 62 
atggagagaa 


accagaacag 


aaaagtggaa 


aattccaaaa 


accagagegt 


ctcttctcct 


60 


ccaaaggatc 


acagctcctc 


accaqcaaqq 


gaacaaaact 


qqatqqaqaa 

3 3 ** 3 **3 ***** 


tgagtttgat 


120 


gaattgacag 


aagtaggctt 


c aga agg t c a 


gtaataacaa 


actcctccga 


gctaaaggag 


180 


catgctctaa 


cccatcacaa 


ggaagc t aaa 


aaccttgaaa 


aaagtgttaa 


cactcctgtg 


240 


gacgttttgt 


ctttgttttt 


caccccagtc 


cacacctctg 


ctgttagtga 


ggggctcctg 


300 


gtcatcacag 


ccacatgtgg 


actcccctgc 


tctgtacctc 


ccagcctttc 


tgccaacagc 


360 


actggcctca 


cagtgctgtc 


Ctcrqqcrattcf 


agtgagatag 


ctataatgtg 


gagtegctge 


420 


tgcctgctaa 


ctcgcccaga 


cgttcttcga 


gtgtcacatt 


tqaaqacqaq 


cactgaggat 


480 


gaggaaccaa 


ctgaagaata 


taaaaatott 


a era a a t cr c acr 


cafccfcaacrtcr 


occaaaaatcr 


540 


gaggatccta 


tccctgaatc 




aacacatcta 


tttcrqqataq 

*- *- *-y 3 3** w **3 


caaqqtacraq 


600 


aatcaacaga 


aaaagcctgt 


ggaaaacagg 


atgaaggagg 


acaaaagcag 


catcagggaa 


660 


gcaatcagca 


aagccaagag 


tacagcaaat 


ataaagacag 


aacaggaagg 


tgaggcatct 


720 


gagaagagct 


tgcatctgag 


cccacagcat 


atcacacacc 


agactatgee 


tataggacag 


780 


agaggcagtg 


agcaaggcaa 


acgtgtggag 


aacattaatg 


gaacctccta 


ccctagtcta 


840 


cagcagaaaa 


ccaatgctgt 


taagaaatta 


cataaatgtg 


atgaatgtgg 


gaaatccttc 


900 
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aaatataatt 


cccgccttgt 


tcaacataaa 


attatgcaca 


ctggggaaaa 


gcgctatgaa 


960 


tgtgatgact 


gtggagggac 


tttccggagc 


agctcgagcc 


ttcgggtcca 


caaacggatc 


1020 


cacactgggg 


agaagccgta 


caagtgtgag 


gaatgtggga 


aagcctacat 


gtcctactcc 


1080 


agccttataa 


accacaaaag 


cacccattct 


ggggagaaga 


actgtaaatg 


tgatgaatgt 


1140 


ggaaaatcct 


tcaattatag 


ctctgttctg 


gaccagcata 


aaaggatcca 


cactggggag 


1200 


aagccctatg 


aatgtggtga 


gtgtgggaag 


gccttcagga 


acagctctgg 


gctcagagtc 


1260 


cacaaaagga 


tccacacggg 


ggagaagccc 


tatgaatgcg 


acatctgtgg 


gaaaaccttc 


1320 


agtaacagct 


ctggccttag 


ggtccataaa 


aggatccaca 


caggtgagaa 


accttacgaa 


1380 


tgtgatgagt 


gtgggaaggc 


cttcattact 


tgtagaacac 


ttctcaacca 


taaaagcatc 


1440 


cactttggag 


ataaacccta 


taaatgtgat 


gagtgtgaga 


aatcttttaa 


ttatagctct 


1500 


cttctcattc 


agcataaagt 


catccacact 


ggagagaaac 


cttatgaatg 


tgatgaatgt 


1560 


gggaaggctt 


tcaggaacag 


ctcaggcctc 


atagtgcata 


aaaggatcca 


cacaggagag 


1620 


aaaccttaca 


agtgtgatgt 


ctgtggcaaa 


gcattcagct 


atagctcagg 


cctcgcagtc 


1680 


cataaaagca 


ttcaccctgg 


gaagaaagcc 


catgaatgta 


aggagtgtgg 


gaaatccttt 


1740 


agttataact 


cactacttct 


tcaacacaga 


actattcata 


ccggagagag 


accttatgta 


1800 


tgtgatgtgt 


gtgggaaaac 


gttcagaaac 


aatgcaggcc 


tcaaagtcca 


caggaggctc 


1860 


catactgggg 


aaaaaccata 


taagtgtgat 


gtgtgtggga 


aagcctatat 


ctcacgctct 


1920 


agccttaaaa 


atcacaaagg 


aatccacctt 


ggggagaagc 


cctataaatg 


tagctattgt 


1980 


gagaaatcct 


tcaactacag 


ctctgccctt 


gaacagcata 


aaaggattca 


taccagggaa 


2040 


aaaccctttg 


ggtgtgatga 


gtgtggtaaa 


gctttcagaa 


ataattctgg 


ccttaaagta 


2100 


cataaacgaa 


tccacactgg 


ggaacgacct 


tacaaatgtg 


aagaatgtgg 


gaaagcatac 


2160 


atctctctct 


cgagccttat 


aaatcataaa 


agtgtacacc 


ctggggagaa 


gccctttaag 


2220 


tgtgacgagt 


gtgagaaggc 


cttcatcaca 


taccgaaccc 


ttacaaacca 


caaaaaagtt 


2280 


catcttgggg 


agaagcccta 


caaatgtgat 


gtgtgtgaga 


aatcttttaa 


ttacacatcg 


2340 


ctcctttctc 


agcacagaag 


ggtccacact 


agagagaaac 


cctatgaatg 


tgacaggtgt 


2400 


gagaaggtct 


tcagaaacaa 


ctcaagcctt 


aaagttcata 


aaagaatcca 


tactggggag 


2460 


aggccctatg 


aatgtgatgt 


gtgtggaaaa 


gcctacatct 


cacactcaag 


ccttattaac 


2520 


cataagagta 


cccaccctgg 


caagacaccc 


catacatgtg 


atgaatgtgg 


aaaagctttt 


2580 


ttctcaagca 


gaactcttat 


aagccataaa 


agagtccatc 


ttggggagaa 


acccttcaag 


2640 
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tgtgttgagt 

33 33 


gtgggaaatc 


tttcagttac 


agctctctcc 


cacacagggg 


agaaacccta 


tgtgtgtgat 


aggtgtggga 


ggcctcacag 


tgcataaaag 


gatccacaca 


ggtgagaaac 


gggaaggcat 


acatctcaca 


ctcaagtctt 


atcaatcata 


cagccctata 


attgtgagtg 


tgggaaatcc 


ttcaattata. 


aaaaggatcc 


acactggaaa 


gaagccatac 


cgatgtaatg 


gccaccgcgg 


acctcatgaa 


agtggaccat 


gaagaggagc 


ctttctaaac 


aaaagaagct 


catggccaag 


atcttggagc 


aagaagatcc 


tcggggaact 


ggccatggtg 


ctggaccaga 


aggaagctgg 

33 3 33 


agaacgaggc 

j 3 33 


actttcccag 


tggaaagaat 


ccaaattcct 


gttggcagga 


taatcagcaa 


gtaatacaga 


tgcattcagc 


acaaaaccga 


qqqccqatta 


tgtggagaaa 


gatctttgtt 


ctgccttttg 


ccctttgaag 


catatgatct 


tacactccct 


ccccttttga 


aatccttaat 


aaaaacctgc 


cttctgctga 


ttgaacgcag 


gccagatggc 


gagatccgga 


gaagaaaggg 


gcacaagagg 


ggagtaggta 


acaagataaa 


ctacagagcc 


atcgtggcag 

3 3 3 3 


ccctgggcct 


ccatttcaga 


ccaccatcag 


ggtctcagtc 


agtcatcatt 


ctcatttact 


gaatactacc 


ttccagctgt 


ctgagattaa 


gcctaagcca 


agtcatcatt 


ctcatttact 


cgggtggctg 


ggcgagggcg 


ctgagattaa 


gcagaacagc 


agctaaagca 


gtaacagcag 


acagggaaga 

3 ^> 3 3 


gggtcccgcg 


tgaaccgaac 


tcaacttcca 


atttgcgtag 


aggtagctgg 

^3 3 3 33 


gtctttaagg 


ttagagggaa 


qcqqqtqqqq 


gttgaacaaa 


gtqqaqattc 


acaaaagcag 


cagatgacct 


qtqcqggcgg 


cagcctcctg 


ccctcctccc 


gaagagttct 


tttgacagcc 


gtgagcttcc 


ccgccaggaa 


ctagaaacgt 


ggctttgggc 


tgtggaaacg 


ctgcctctgt 


gtggacgggt 


cgctgcgcgc 


ccagcgttct 


tctgcggttc 


ccgcctcggg 


gaccctttgc 


ggggaggcct 


caggtcagcg 


agagcagagc 


ggccccaggg 


cctctgagct 


ccggccccgg 
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tttctcagca 


caagaggatc 


2700 


aggccttcag 


gaacagctca 


2760 


cctatgaatg 


tgatgagtgt 


2820 


aaagtgtcca 


ccaggggaag 


2880 


gatcagtcct 


tgaccagcac 


2940 


agtgtgcaca 


tatacccaac 


3000 


cccagctctc 


cgagccctac 


3060 


atgatgatgt 


gagctacctg 


3120 


ttgaggcgga 


gctggagaag 


3180 


ttgatgccat 


ttccgcacac 


3240 


gagctgcttc 


cttcaacagc 


3300 


tatccttcag 


agaagcatgt 


3360 


ttgttcctac 


tccctgttcg 


3420 


tggttttacg 


gctcaggaag 


3480 


gcaaccttct 


tagatcaagg 


3540 


aggagctccc 


tccctgatga 


3600 


cgttcatctt 


gcctaagcca 


3660 


cgggtggctg 


ggtgagggcg 


3720 


ccaccatcag 


ggtctcagtc 


3780 


gaatactacc 


ttccagctgt 


3840 


gtctccttca 


gcagcttgcc 


3900 


cctgcgcaga 


ggtagctacc 


3960 


ggcgcaggga 


gggagagacg 


4020 


actagggcgg 


gcgacgtgat 


4080 


cttcgtgcgc 


cggctggagc 


4140 


cttactgggg 


ctgcatcacc 


4200 


ggaagtctct 


cctcgcgggg 


4260 


tcacagcccg 


cccgccgccg 


4320 


ccgccccttg 


cgatggcggg 


4380 


cacgtcccgc 


cctctcctcc 


4440 
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gcttgcggga 


gccggggcgc 


ctcggaggga 


aaccttcccg 


gacacaggcc 


gggagaagag 


4500 


gggcccgtgg 


cgccatcggc 


gcagcgttcc 


tggtttcggc 


catgtttttt 


tttttttttt 


4560 


tgagacggag 


tttcgctctt 


tttgcccagg 


ctggagggca 


atggcgccgt 


ctcggctccc 


4620 


tgcagccgcc 


gcctcccggg 


ttgaagggac 


tctccggcct 


caggccgcgg 


tgttcttgta 


4680 


aattagaacc 


ggcctatgcg 


ccaagcgcgg 


gtctcgcggc 


tgcggagaga 


ggc 


4733 


<210> 63 

<211> 577 

<212> DNA 

<213> Homo sapien 












<400> 63 
aaaaacaaac 


aaaaaaacca 


gttatttact 


tggctataac 


aatgtctgta 


aatggtaaag 


60 


acagaaaaca 


acctaagttg 


ttcattgggg 


gactagataa 


atttaaattg 


gttacattcc 


120 


atacaatgtg 


gaagataaag 


tagctgtgag 


gaacgatgcg 


cctcctccct 


ttacagatgt 


180 


ggaatgaccg 


ccaagataca 


gccaggaaaa 


aagcaagaag 


agaacagttg 


gtgatctgct 


240 


gtttcagttc 


aaaataacca 


attttaaaaa 


aatgtctcta 


gaaggctaca 


taagaggctg 


300 


gtgagacagt 


ggtcctgtgg 


gagggaatgt 


gggtaacagg 


caacaggatg 


ggagggatgg 


360 


tcaccataca 


tcttttggca 


cactctgcat 


tttgaaacat 


atgattggtt 


ttccctagtc 


420 


taacaaaata 


actttaggtc 


caactgactg 


tagtaaattg 


gtcactagca 


aaaaaaaaaa 


480 


aaaaaacaaa 


aaaagcgtgg 


ggggaacccg 


gggccaacgc 


gggtccccgg 


tggggaatgg 


540 


gtttcccggc 


tccaatttcc 


ccccattttg 


cgcacaa 






577 


<210> 64 

<211> 744 

<212> DNA 

<213> Homo sapien 












<400> 64 
aaaaacaaac 


aaaaaaacca 


gttatttact 


tggctataac 


aatgtctgta 


aatggtaaag 


60 


acagaaaaca 


acctaagttg 


ttcattgggg 


gactagataa 


atttaaattg 


gttacattcc 


120 


atacaatgtg gaagataaag 


tagctgtgag 


gaacgatgcg 


cctcctccct 


tacagatgtg 


180 


gaatgaccgc 


caagatacac 


agccaggaaa 


aaagcaagaa 


gagaacagtt 


tgtgatctgc 


240 


tgtttcagtt 


caaaataaca 


atttaaaaaa 


atgtctctag 


aaggctacat 


aagaggctgg 


300 


tgagacagtt 


gctctgtggg 


agggaaagtg 


ggtaacaggg 


aacaggatgg 


gagggattgt 


360 


caccatacat 


ctttttgcac 


actctgcatt 


ttgaaacata 


tgattgtatt 


tcctagtcta 


420 
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acaaaataac attaggtcca actgactgta 
gtcatttgat tttatttttc acagaagggc 
agggagcagt gtgtgctagg agctgccaca 
ctcacgctcc ccctgtccct tgaggtaggt 
taaggcgtaa agtaacacaa tcacggtcac 
aagctaggca agtctgactc taag 

<210> 65 

<211> 318 

<212> DNA 

<213> Homo sapien 

<400> 65 

taagtgtttt aaagaaacta gagcagcaga 
tttctcggcg tcccctccag cgagggggtc 
ccaggtcagg aggctgcccg gccccctccc 
tcgtaatttg tggctaaata taactagtgt 
cattgtatga ggttatcaac aagacccatc 
gcagaagcgc ccctttac 

<210> 66 

<211> 2505 

<212> DNA 

<213> Homo sapien 

<400> 66 

ctgcgccccc aataatcgtg tgttgggtat 
agggcaaatt agaaacaatt tcttgacgac 
ggcgcagggg gacaagtcca gaggccaggc 
ggcggggtag aggtggggta gaggtggggg 
cctcaactgg agggcagcag cagctcaccc 
ctgggggcag ggggttagga acaatcccac 
cccacccatc tgcctgtacg ggtccctctg 
ggccttgcct ggtgcctgga gggaacaggg 
cacctcccct gctcagcggt gtggcctctg 
ccgtgccttg gagtgggggg acagcctttt 
acccttccta ggagcagaac tgttgtggag 
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gtaaattgtc 


actagaatct 


ttgtcttgga 


480 


atatggttag 


atagcctaaa 


acagcagggg 


540 


tgccaggcac 


ctgatagata 


ccagctctat 


600 


gtttctgttt 


ctacttacaa 


gaaaatagac 


660 


acatgtggca 


agcagcacag 


cctggatttg 


720 








744 


gaagagagac 


tcagaagccc 


gtggcacggc 


60 


tccattgctg 


cagttgccgg 


ttttgtccaa 


120 


cactctcaaa 


gttgcttgtt 


aaacacagag 


180 


gttctcacgg 


aaagtataat 


tcagggtgct 


240 


tgggttaaat 


taaagtgatt 


ttcataaagg 


300 








318 


ctctccattg 


ttccctgggc 


cattagtcaa 


60 


aaaaattgac 


tcttaattct 


tcccaccaga 


120 


ctgaggctgg 


ggctggcggc 


aggaggggta 


180 


atctaagatg 


gttcacagca 


cagagacctc 


240 


ccaccccaga 


atcagtgttc 


cagaggacag 


300 


tccctcctgg 


aagaggtcct 


gaccccccac 


360 


ctatcgactg 


gggcgagttc 


ttttcatgag 


420 


tgtgggggaa 


aggtctgtgt 


gttcccccct 


480 


gctctgggga 


ggatcgggaa 


gccccagggt 


540 


ctctgccaca 


gcttctgcct 


ttgggggctt 


600 


gggaagagga 


cggtgaaggt 


tccgaaggga 


660 



WO 02/068633 PCTAJSO 1/436 12 

58 



agggggctgc 


ccccactgaa 


aacgaagctt 


ccagtcacag 


ccccttcatt 


atttatcagg 


720 


acccagggga 


tgaggtggca 


ggggaggggg 


ctgcatggag 


ggagtgccct 


caccctcgtc 


780 


cccagcgcct 


gccccctccc 


gaccaggcct 


gggctgaggc 


ccagggfcaag 


ggggctgagg 


840 


caggccacag 


aggagcaaga 


cttgtcaggg 


gccagacctg 


ggtaggagga 


ttgtcctcca 


900 


ggcacacacg 


gcccccagcc 


ccccagcctg 


tcgaactggg 


ctctcccaga 


aggtccccgg 


960 


ctccagccca 


agcagggagc 


caggttgggg 


gtgtgggaag 


gcagaagtcc 


cagggatcct 


1020 


gggggaggct 


caggttgtac 


ctgcaggcca 


cagtctcctc 


gacagacctc 


ggacgaggtt 


1080 


gtaagtgttt 


taaagaaact 


agagcagcag 


agaagagaga 


ctcagaagcc 


cgtggcacgg 


1140 


ctttctcggc 


gtcccctcca 


gcgagggggt 


ctccattgct 


gcagttgccg 


gttttgtcca 


1200 


accaggtcag 


gaggctgccc 


ggccccctcc 


ccactctcaa 


agttgcttgt 


taaacacaga 


1260 


gtcgtaattt 


gtggctaaat 


ataactagtg 


tgttctcacg 


gaaagtataa 


ttcagggtgc 


1320 


tcattgtatg 


aggttatcaa 


caagacccat 


ctgggttaaa 


ttaaagtgat 


tttcataaag 


1380 


ggcagaagcg 


cccctttgcc 


tggtttcccc 


tgccttttta 


ttagtgacag 


tgttattgtt 


1440 


gcaattatta 


gaggggatca 


aggaaggtgg 


gagttgccag 


ggacccccag 


gctgaggagg 


1500 


gagcctccca 


gcccccctcc 


actcaccatt 


gtcctgctgc 


ccctccagcc 


ccctggtcac 


1560 


tgtcacctct 


gcggctggag 


ggctggagaa 


aggcccagga 


gcccagggaa 


cctccaccat 


1620 


gccaggctgc 


cttccctaaa 


ggccggctcc 


ttccccccag 


gcagccagtc 


tgggggtgga 


1680 


cctggagggg 


cagggttctc 


tgggaatctc 


tgggccaagg 


gatgcccttt 


ggggatctgc 


1740 


tggaataaag 


agtaatcggt 


tttccttgca 


ggaggttgac 


ccgaaacagc 


cctccgtctt 


1800 


cctgcccatg 


aggatgagga 


caggaagcca 


gacccgggag 


agaaagccca 


gatcagacag 


1860 


atgggggttg 


ggagggccta 


gtggctgaaa 


ctgccctgct 


tggggacaat 


gctcaggcag 


1920 


gaccaatcca 


aacgagggca 


ctgcttatgt 


ggtccttccg 


gagcatcctc 


caaggcagag 


1980 


cctggcttcg 


ggccctccct 


gcaaggacac 


ctgcctccac 


tccctgccag 


accaggcctg 


2040 


gctgcagagg 


cggaggggcg 


gcagggctgc 


ctgtctccac 


tgtttggcgg 


tgtttacacg 


2100 


tgtcgctgtc 


agggtgtctg 


gacagtgggg 


gtcgggggcc 


tcccgggccc 


agacttcccc 


2160 


acgctcccca 


agagcagcag 


atgtaggtct 


cccggctccc 


aaacctctga 


gatgctgctc 


2220 


tgtctgaaat 


catcatataa 


ataatcattt 


tttaatacaa 


gggggtgggg 


gtgtcaggca 


2280 


agttccatga 


aattgtgaat 


tagccgctgc 


tccaaataaa 


tgcctgcttc 


ggcccgagag 


2340 


cagccgcgct 


gcgcccgctg 


cgcccccaat 


aatcgtgtgc 


tgggtatctc 


tccatttgtt 


2400 



WO 02/068633 
ccctgggcca ttagtccaaa gggcacaatt 



tcttaattct 


tcccaccaga 


ggcgcagggg 


<210> 67 

<211> 247 

<212> DNA 

<213> Homo sapien 




<400> 67 
actgagcagc 


tacggaatgc 


aaggcactgt 


ggctcagggt 


caggcagggg 


aggtgagata 


tcctgagggt 


tgtgaggcac 


aaaatactaa 


tgaggagaaa 


tcaaggaaag 


catggtttta 


actgaag 






<210> 68 

<211> 2458 

<212> DNA 

<213> Homo sapien 




<400> 68 
ttgcgctgtt 


ttgggtgatc 


1 1* a t" era t a c c 


gtgctatctg 


tgcgtcggta 


fcafcafcatcfca 


gctcgtgatg 


aatgggctgc 


acre a 1 1 1 at a 


ttgtgggcct 


tcggggattt 




tgaggatgtc 


aggtagcccg 


ttactaatat 


ttagtaatta 


caggaaaact 


aacttataaa 


ctgagcagct 


acggaatgca 


aoacactata 


ctcagggtca 


ggcaggggag 


y u-y y " ^*o.cici 


ctgagggttg 


tgaggcacaa 


cicl i-y L^cty y ct 


aggagaaatc 


aaggaaagca 


■f- rrfit* t~ t* a na 
*-y~J I- 1- 1- Lci^a 


tgaagaggct 


ttttaaagtg 


gagggaaggc 


tcttgtcagg 


aagaacagag 


tccaatttgg 


ataagactgg 


catgggggca 


gagggggact 


actcaagtag 


gtgattgata 


ggtgatcttg 


aacattggct 


taggacaatc 


cctttggctt 


ttgggggaga 


caaggatggg 


aagatgcatg 
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agaaacaatt 


tcttgacgac 


aaaaattgac 


2460 


ggacaagtcc 


cagag 




2505 


aggaagtagg 


gtgagtatac 


tccccacaag 


60 


aaaacccaca 


gccatacact 


agctggcctg 


120 


gagactagag 


aagtaagaaa 


tgtcttgaca 


18 0 


gaacatgtgt 


gggatgtttt 


ttgtatcgag 


240 
247 


gtcgcgcgcg 


teatgettgt 


tttctcagcg 


60 


ctgcacgggc 


tggtgcgacg 


cgtggtttgt 


120 


cctatggggc 


ctgttcaagt 


atgggggcct 


180 


cgcccgttgc 


ttgtggttgg acggccgctc 


240 


cggaagtttg 


tgcaagattg 


agataaatga 


300 


aatcttaaag 


acattgaatg ggttaatgta 


360 


ggagtagggt 


gagtatactc 


cccacaaggg 


420 


aacccacagc 


catacactag ctggcctgtc 


480 


gactagagaa 


gtaagaaatg 


tcttgacatg 


540 


acatgtgtgg 


gatgtttttt 


gtatcgagac 


600 


aaactgaggc 


atagagatgc 


caataccagg 


660 


ctgeaggata 


gggcatatgt 


aggggagggg 


720 


tgaatgtcag 


gtgacagagt 


caaagcttgc 


780 


ggggcttttg 


agcaggaaga gtgataaatc 


840 


tggegaaatt 


gtgactcagg 


taagggacag 


900 


ataaatgggc 


cctgagcttg gcaggcaaca 


960 
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ggaaggcagt 


gcaatgtttt 


gtagaaatag 


acttagcaga 


gcaggcgccc 


agcattcagc 


1020 


gagaccacag 


attcttagcc 


actgtatgac 


ttgaaggtga 


tactaactcc 


cctgaacctg 


1080 


tctcatctac 


aaaatgggca 


tagtagcacc 


tatgtcatag 


ggttgcttta 


tggcaggtaa 


1140 


ttattcttgt 


aaagcattta 


gcacagctct 


gatgacacat 


ggggaatgct 


cagtaaatat 


1200 


taatgtcact 


aattccagtc 


aattccatta 


actggaatta 


attagttgga 


gttaggaagg 


1260 


aaacaggtgt 


cagctgggtc 


gactgaagga 


agcagagatc 


tcaggaagca 


gtacatggtg 


1320 


gtacagtggg 


agcatggcat 


ctgtggggga 


gactgtctgc 


agtgggatcc 


cacctctatt 


1380 


gctggccagc 


tgtgatcttg 


tacaggatac 


ttaacccctt 


tctagctcag 


tttctttagc 


1440 


tgtaaagcca 


gcgcaatgtt 


accatggagc 


tggtatgtga 


aagcaatgca 


tgttaagtgt 


1500 


gtagtatgtg 


cctgatacat 


tgttagtgcc 


cagtaaatgt 


tagttatcag 


gtgaggggtg 


1560 


ggatgtgggg 


ttgagtagat 


ttgcctcgtc 


cactcataga 


ggcaggatgt 


gcaaggaggg 


1620 


gtggagacat 


gcagcaaggg 


attgggcatg 


aaggttttct 


ccaggcccgg 


ggagaacact 


1680 


cggcctggag 


atattcagga 


gtccctggcc 


cagaggagat 


tacaactctg 


cagggatatg 


1740 


agccacgagt 


ggagtgtaga 


cataggccta 


aggaagcaat 


cagacacttc 


cagaatagag 


1800 


gacatttgct 


aagacagctg 


gcctggatgc 


ttcaaaaagt 


atcatgggaa 


aatatgtcgg 


1860 


ggttgttgta 


gactgaaagt 


gactagagtt 


gaaaccaaaa 


gaatgtatga 


accttgatgg 


1920 


gacctggttg 


ttgtcgttgt 


ttttaaagac 


aaacattctg 


gagacaattg 


ttgaatactg 


1980 


attaagggct 


ctgtattaga 


ggctataaag 


aagttactgt 


ttatattttt 


aggcatgatc 


2040 


atcttaatgt 


aatcatgtag 


aacggtgtct 


ttgtgctcag 


gaaatctatg 


tggtaatact 


2100 


caggggtcga 


aatgtcattc 


tatgtacaca 


tgcatggggt 


ggagagaata 


gtaaatctgg 


2160 


caaaatgtta 


acaactattc 


aatccaagtg 


gagagtatgt 


atctgagtat 


tcattgtgct 


2220 


agcctttcaa 


ctttttctgg 


atgtgtggaa 


tttttcaata 


taaaaaaata 


agaaaaaaaa 


2280 


gaggcctatg 


gaaaaactgg 


gaaccagtat 


gcaacaggaa 


aaggaattct 


ttgaaggaaa 


2340 


aacaggatcc 


acctaccaag 


agagcaattt 


caagggagag 


attaagcaca 


caatgaaggc 


2400 


tgcaaataga 


tcatcatatt 


tggtcactgg 


gagattgtgg 


cagcttggaa 


atagtccg 


2458 



<210> 69 

<211> 894 

<212> DNA 

<213> Homo sapien 



<220> 

<221> misc feature 
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<222> (341) . . (341) 
<223> a, c, g or t 



<400> 69 
gafcgataatc 


atataggcca 


atggtgacfcc tagatctgct 


cgagcgggcg 


cagtgtgatg 


60 


aatcataatc 


acaaca acrcft 


actccgctac tcttttacfct 


cttttgtaaa 


gtattgactc 


120 


taaaaaacta 


cagtatacaa 


agfcctcaaca tgttttttaa 


aagaaafcaag gagcaagcga 


180 


r* t* or c c c fc or c* fc 


aaaaatcaca 


aaccgatfctt tgtagaafcafc 


tttgtgcccc 


aggcattaat 


240 


t* t* ractaant 


ccaoaacctcr 


cagttcagag aafcgatttcfc 


tatgatgata 


aaaatcgaat 


300 


>4 y u ^ ^ Gl >H Ci ^ 


cfafccicrfcfcfccrc 


atttttttta attaacttgg 


naataggaca 


cctcaagtfct 


360 


araorafcfcfccafc 


tttcttttaa 


aacacagtca caagafctaafc 


ctggtgaatc 


cttttgtcac 


420 


aoat fc c fc ccra 


L y L y "-y ^y u y 


cgcgtctccg tgtgfcgtgfcg 


tgtgcatgtg 


tgtaaaactg 


480 


gtcacattta 


attgcttttt 


gggaccattg aatagttggg 


aagtaagaat 


ttttfcaattg 


540 


gcattaaaac 


ggtcctcaac 


tgttaaatta accaaatttg 


acctgtcttt 


aaaaaaaggc 


600 


ttatttgtat 


gattttgggc 


taactccccg gggaccatat 


taaatgacaa 


aaatgctcct 


660 


ttgggtgaca 


caccctacaa 


acrfcafcfcfccrcfc cttacaaaca 


taaacgccca 


cattcttaat 


720 


atctaatatt 


tttgaccagt 


gatgtcttat gctgtcatct 


gaacctagag 


aagcagtgtc 


780 


agaggaaacc 


tggtgtccat 


gtgtcfctagc aaagggtact 


gatcgagggt 


ctgtgacaaa 


840 


gatggcccga 


gagctggggt 


atcatggcat agtgttctgg 


gtaatgttcg 


ccac 


894 


<210> 70 

<211> 1335 

<212> DNA 

<213> Homo sapien 










<220> 

<221> misc_feature 
<222> (278) . . (278) 
<223> a, c, g or t 










<400> 70 

cgtggtcgcg gcgaggtact 


ccgctactct tttacttctt 


ttgtaaagta 


ttgactctgg 


60 


aaggctacag 


tatacaaagt 


ctcaacatgt tttttaaaag 


aaataaggag 


caagcgactg 


120 


ccctgctaga aatcacaaac 


cgatttttgt agaatatttt 


gtgccccagg 


cattaatttc 


180 


actgactcca gaacctgcag 


ttcagagaat gatttcttat 


gatgataaaa 


atcgaatggg 


240 


atcagacgat 


ggtttgcatt 


tttttfcaatt aacttggnaa 

> 


taggacacct 


caagtttgag 


300 


atttcatttt 


cttttagaac 


acagtcacaa gafctaatctg gtgaatcctt 


ttgtcacagt 


360 
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tctcgtgtgt 


gtgtgcgcgt 


ctccgtgtgt gtgtgtgtgc atgtgtgtaa aactggtcac 


420 


atttaattgc 


tttttggacc 


attgaatagt tgggaagtaa gaatttttta attggcatga 


480 


gacggttcct 


caactgttaa 


attaaccaac tttgacctgt ctttagaaaa aggcttattt 


540 


gtatgatttt 


gggctaactc 


cccggggacc atattaaatg acaaaaatgc tcctttgggt 


600 


gacacaccct 


acaaagtatt 


tgctgttacg aacataaacg cccacattct taatatctaa 


660 


tatttttgac 


cagtgatgtt 


ttatgctgtc atctgaaccc tagagaagca gtgtcagagg 


720 


aaaccttggt 


gtcacatgtg 


tcttagcaaa agggttacca tgatcgaggg tcatgtgacc 


780 


aaaagatgct 


ccagagaagc 


ttgagaattt gtttcaagtt gggaggaggg ttggagatac 


840 


aaaaatcact 


ctgctctaca 


ggactcttca gctgtctatg caagaaattc cgttttctct 


900 


ttcagcacct 


ggaaagacac 


agcagcccac tgaggcgata ggtgattcac taagcacaag 


960 


aggaatgttt 


tctaagcaag 


gcgtcccttg cctctcaaac aaatgccctc caagtttgtt 


1020 


agggtttcta 


ttcctgcaac 


ttgtggtatc aaaaccactt cctgaaattg tcaaagcact 


1080 


gccaaaataa 


atgtttttcc 


cccttctaag aaaaaaaaaa tgacagtgct catatttgac 


1140 


acttgtgtat 


tggactctct 


tttgaatgaa taaaaaggaa aaggggtttg gtgtaattcc 


1200 


tgatggggtg 


cgtgttgttt 


ttcatgccat ggtttgtgaa ttttaattgt ggtttcccat 


1260 


ttcgttgttg 


taactgggca 


gaaattaaaa aagaaaaatc aataaaaata caaagaaatg 


1320 


gttaaaaaaa 


aaaaa 




1335 



<210> 71 

<211> 137 

<212> DNA 

<213> Homo sapien 

<400> 71 

cgggcaggta cgcacaagca ggcccagctg ggagacgcta ggataactaa gacctggtca 60 

gcgccctgag gagtcttgtc tggataaagg gagacacaca ctagcttgga cttatgcgtg 120 

cagcggagct gtgtaga 137 

<210> 72 

<211> 694 

<212> DNA 

<213> Homo sapien 

<400> 72 

ccgcccgggc aggtaccaac cccagcacac cccaacagcc tttcctcggc ccctcctcag 60 
gcctcctaat tactctttct cagcctggag tgtggggccg ttaccgtcct cttccccctt 120 
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ctccttccat 


actgcactta 


accttgctgg 


63 

aagacttaat 


gatggagatt 


tagggcaatc 


180 


tgtggctgct 


tggacccttc 


cctggaccaa 


aggaacttaa 


aacccaaacc 


tgacactgga 


240 


atgaaatcca 


agtttttaaa 


tatcaccttt 


caatcactca 


cagatctcac 


tctatcttaa 


300 


aatactcagc 


ctcactcctt 


aatgagtgct 


tgctgaaggg 


agaaaattcc 


attttaaaaa 


360 


cgtattcact 


ttactgatta 


ctgtgcaatt 


tgaattaagt 


cacgattctt 


tagtaaatgg 


420 


aggtgagaat 


ctcagattca 


aattgtcaga 


gaccatgatt 


tagaagtcta 


ccaaacaccc 


480 


agtttccttc 


cactgtttta 


qqqtaacaqq 


aaaacatgag 


attggggtgg 


tgtccgctat 


540 


taaatggaac 


cacacatcat 


gaaattcaat 


tctcatgtta 


agacattctg 


tattgtggga 


600 


tgtcaaaagt 


atttcccaaa 


ctttcgtttg 


acctgcagag 


ttggagatgg 


cttacctccc 


660 


tataacttca 


agtttgtttc 


acaaagcttg 


gcgt 






694 


<210> 73 
<211> 8095 
<212> DNA 
<213> Homo sapien 












<400> 73 
tttttttttt 


gccacctaga 


gatgataatt 


tattgtttta 


ccatgactca 


gaagagaaac 


60 


aacataaaga 


gaatatttca 


aatccccaca 


atttccttcc 


tcaacctcac 


tactcttaac 


120 


atttctttat 


cagacgccac 


tggcttccta 


aaatggacca 


ctgactatgt 


atgtgtacac 


180 


atttcattat 


gctgcctttt 


ctcttatgat 


taaaacttta 


gccctcattc 


gaggtttcca 


240 


atggttactt 


ttagtggagg 


agttccctag 


cttttaaaaa 


accacttttc 


ctctaagatt 


300 


ccattattta 


ttgaaagaag 


tctttctaga 


aatgttaagg 


aggattttaa 


atgaacacat 


360 


tcaattaaaa 


aaaaaatcac 


gtattgaaca 


tctaccaagc 


atctggactc 


ttcggaacct 


420 


agtaaaatga 


aaaaatccag 


ttttaacaac 


agtaacttca 


ttctgcgggt 


atacagagac 


480 


aagcacgttt 


cttcttttgg 


tctaatttat 


tctaaacgaa 


gaagctggga 


actgacaaaa 


540 


caggacaggt 


tgtttttaat 


ccagtctaca 


aataaacaag 


acaatgcctg 


agttagccct 


600 


ctatatagat 


ttaggcttat 


gctgacctcg 


ttgtaaaatc 


tgtatttaac 


taaaagttaa 


660 


taaaaataca 


tatgttcatt 


ttaaaataat 


tactgatttt 


gcttggctat 


cccacccctt 


720 


acccccaaac 


tcatatattt 


ttaggacaag 


attttcctgc 


ataaccacaa 


cctgtctcct 


780 


cccaccccac 


ccccatcata 


gatgttttca 


aataagaacc 


cctgcgatca 


gcagaagcat 


840 


ctctaatcta 


acatgctttg 


tccttgctag 


ggcaggctaa 


aagctttaaa 


aagcaaccgg 


900 


atgctcttct 


ctggttgagg 


tgaggggaag 


gcgctcgggt 


accaacccca 


gcacacccca 


960 
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acagcctttc ctcggcccct 


cctcaggcct 


cctaattact 


ctttctcagc 


ctggagtgtg 


1020 


gggccgttac cgtcctcttc 


ccccttctcc 


ttccatactg 


cacttaacct 


tgctggaaga 


1080 


cttaatgatg gagatttagg gcaactgtgg ctgcttggac ccttccctgg 


accaaaggaa 


1140 


cttaaaaccc aaacctgaca ctggaatgaa atccaagttt ttaaatatca cctttcaatc 


1200 


actcacagat ctcactctat 


cttaaaatac 


tcagcctcac 


tccttaatga gtgcttgctg 


1260 


aagggagaaa attccatttt 


aaaaacgtat 


tcactttact 


gattactgtg 


caatttgaat 


1320 


taagtcacga ttctttagta aatggaggtg agaatctcag attcaaattg tcagagacca 


1380 


tgatttagaa gtctaccaaa 


cacccagttt 


ccttccactg 


ttttagggta 


acaggaaaac 


1440 


atgagattgg ggtggtgtcc 


gctattaaat 


ggaaccacac 


atcatgaaat 


tcaattctca 


1500 


tgttaagaca ttctgtattg 


tgggatgtca 


aaagtatttc 


ccaaactttc 


gtttgacctg 


1560 


cagagttgga gatggcttac 


ctccctataa 


cttcaagttt 


gtttcacaaa 


gctttgaaaa 


1620 


gtaaaacaga taatttcatt 


ttcagataat 


aaaaaatctg 


aatagcaaaa 


taattgcttt 


1680 


taaatgtagt gtgtccactc 


taaaaaaaaa 


aaccctaaat 


ctatgttaga 


aaaacttttc 


1740 


aaataatgcc ttttattaaa 


ttctccagta 


gtagttgaaa 


taaaaatcta 


ccctaatttc 


1800 


tatgaaatga tctatttata 


tcactgactt 


ttctttttct 


ctgattctat 


atttcattta 


1860 


acaatctgca gactttcacc 


ccatttccca 


gatgggaaaa 


ccctagcccc 


ctcgtatctc 


1920 


tgagaagttg ctcagagtag 


gacacagaga 


aatatggccc 


ccaccctggg 


aagtactgct 


1980 


gtcactgttt aagtgtattt 


cagttctgtt 


actccaattc 


atacacacag 


tcttccatga 


2040 


ggatggtagg atgaacctgg ttagctggct ttggataagt agatcagcat gactacctgg 


2100 


aataaaagtg actgactcta ggataaaaat taaaaaaaga ttctttcaca gcaacgagtc 


2160 


tttgcaaaac ctctctccta 


ataatcacaa 


accaggggaa gaaaagtggg 


agcagggaac 


2220 


acaggaacac agccaaaggg 


aatattgcaa aatgcttccc gagcttcatc 


agacagactt 


2280 


cttgcaatgc cacgactgga 


tgcatctgca 


cacaattccg ggaaatgccc 


accttgctgt 


2340 


tctcctatcc ccaattttct 


ttctttcttt 


ctttcttttt 


ttcttttttt 


tgagacagag 


2400 


tctctgttac ccaggctgga gtgcagtggt gcaatctcgg ctcactggca acctctgcct 


2460 


cccagattta ggtgattctc 


ctgcctcagc 


ctcctgagta gctgggacta 


caggcgccca 


2520 


ccaccacgcc cggctaattt 


ttgtattttt 


agtagagaca 


ggctttcacc 


acgttggcca 


2580 


ggctggtctc aaactcctga 


cctcaagtga 


tccgcccacc 


tctgcctccc 


aaagtgctga 


2640 


gattacagga gtgagccacc 


gtgcctggcc 


agtcctatcc 


tccccccaac 


cttttttttt 


2700 


tttttgacat ggagtctcac 


gccatcaccc 


aggctggagt 


gcagtggcgc 


catcttggct 


2760 
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cactgcaacc 


tccgcctccc 


atgttcaagt 


gatcctcctg 


cctcagcctc 


ctgagtagct 


2820 


gggactacag 


atgaagacaa 


gcacctgtgg 


tgcccctcac 


tgcaagaagt 


cagggaggca 


2880 


ttccacagcc 


tgggtgccca 


cagtcctgcc 


ctgtaccctc 


tggggccctt 


ttggcacggt 


2940 


ggcagcgctt 


ccagatttcc 


tttcagaaag 


atgcagtcct 


tccgcagctg 


ggccatgcag 


3000 


atctcctggc 


atccaacttg 


ttggaaacag 


tggccctctg 


ctcaatgatg 


agactgggga 


3060 


ggggaaacag 


gaggacattt 


caaggaaagg 


atggagcgtg 


gatatgaatg 


ggaagcgggt 


3120 


ggtgggggag 


ctcatcagca 


tcctcagagg 


ggctcatgcc 


cacgctgcaa 


cacagaatgg 


3180 


gacttgccag 


atgtttgtag 


tcgactctga 


gtgccccgtg 


ctgagaaacc 


tgaaagcaca 


3240 


cccacctatg 


gctgcgcgtg 


ttgcacgctc 


aaggctgagt 


tcacatagtt 


ctgtagcctc 


3300 


ctcctacacc 


aagtcaggtc 


ggccctgtgt 


gaccagtaga 


agagatggga 


tgtcactttc 


3360 


gagggtgctt 


ccaggcgagg 


ctggcctgaa 


tgagaatgag 


gagcaggacg 


ctccccaaga 


3420 


gattgccttg 


gacatcagcc 


tgggccacat 


ctacaagttc 


agacccattc 


agcagctaaa 


3480 


ttccaggagc 


atcacggaga 


atctccggcg 


agctcagcac 


caaggcaggc 


actgtgctgg 


3540 


ctgtgggccg 


gatgctacag 


aaactgtgtt 


atcggacacg 


ggtcctgatc 


taggccccaa 


3600 


gagagagttc 


ttgtacaaga 


aagaattggg 


gccaggcatg 


tttctggcgc 


tgtgtgccca 


3660 


ggcccagccg 


ggggcctaca 


ctgatgagaa 


cctcatggga 


ctgattgagc 


tgctgtgccg 


3720 


caccagcctg 


gacgtggggc 


tccgcctgct 


gcccaaagtt 


gacctccagc 


agcttctcct 


3780 


cttgctcctg 


gagaacatcc 


gggagtggcc 


agggaaggcg 


cttccttcca 


ggacagatgt 


3840 


cccacggctt 


gcagatggct 


gggcccagga 


gacggtgcta 


gcccttcctc 


tgagagaagg 


3900 


ggtgcaggct 


gccgccaccg 


tgcccatcct 


cctgtacaac 


ctggaggatg 


gcttgtcaga 


3960 


ccatcccctg 


gaccagggcc 


ccgctgccct 


gcccggcggc 


cctgcagccc 


tgcctcggct 


4020 


ccagctctca 


catcgccaaa 


gaagcccaag 


atacaggcac 


ctggggaaac 


cgttgacggt 


4080 


ttttgctgtt 


ccagttttgg 


ctgcttctca 


agatcacgaa 


gcccagggca 


ctctgcaagg 


4140 


gtttctgcaa 


gttcagcagt 


tcatcgctgg 


agggcgcttc 


cttccaggac 


agatgtccca 


4200 


cggcttgcag 


atggctgggc 


cccaggagac 


ggtgctagcc 


cttcctctga 


gagaaggggt 


4260 


gcaggctgcc 


gccaccgcgc 


ccatcctcct 


gtacaacctg 


gaggatggct 


tgtcagacca 


4320 


tcccctggac 


caggggcccc 


gctgccctgc 


ccggcggccc 


tgcagccctg 


cctcggctcc 


4380 


agctcccaca 


tcgccaaaga 


agcccaagat 


gcaggcacct 


ggggaaacgt 


ttcccactga 


4440 


ctggagcccc 


ccgcccgtgg 


aattcctcaa 


cccgagggtg 


ctgcaggcca 


gtcgggaggc 


4500 
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cccggcccag 


aggtgggtgg 


gtgtggtggg 


cccccagggc 


ctgaggagac 


tggctggtga 


4560 


gctgcccgag 


gagttggagc 


aggaacacct 


ggacttggac 


ccgaagaggg 


gcctggcctt 


4620 


gccagagaag 


ctgttctgga 


acacgtcagg 


cctgagccag 


caggctgcgg 


ccccagagtt 


4680 


ttcctggggg 


ggctcaggaa 


gctacttcaa 


caacctggac 


tacttactgc 


aggagaagag 


4740 


ggaacaggcc 


ctggagcagg 


agcgagagag 


gctgcttctg 


caggagtgtc 


tcaatctcaa 


4800 


ctccttggat 


cttgatgaag 


aggaagtgcc 


actcacaccc 


gagcacagaa 


agaggcaaga 


4860 


gagctctctg 


gggccctttc 


ataagggtac 


caatcctatt 


catgaaggct 


ccaccctcat 


4920 


gcctcatcac 


ctcccaaagg 


ccccacttcc 


taataccttc 


accctggggc 


tttccttccg 


4980 


gagacaagca 


gtaaataaga 


tcagtgaagt 


tgtgctgcaa 


gggctcctga 


gaaaggctaa 


5040 


cgctgggggc 


ataaggagtg 


ctgggaaagg 


tgtgggctct 


gatgatgtgg 


gctctaatca 


5100 


tgtgggcttt 


gatgatgaag 


gctctgatga 


tgaagggatg 


ctggtggaaa 


agtactcagt 


5160 


gtccctgcag 


accatcccgc 


cggtccatcc 


aggtgagact 


gtgtttctgc 


ccaggtgtca 


5220 


ccccctgcca 


tgcatcctgg 


actcctcact 


cctgaagcca 


cgcagccacc 


tggaagggct 


5280 


gttcctcagg 


cagtatgctg 


agcattggga 


cctcaaggat 


gaggaagatg 


cagtctctgc 


5340 


cctagaggag 


cttacagcag 


caggaagttt 


ctgtcatagg 


acagacccag 


ggctcaccaa 


5400 


gactcaagca 


gatgatgaag 


cctggggctc 


actggcccaa 


tcagcgtatt 


cagactggct 


5460 


ggctgcttat 


gaggctcttg 


ggccagggct 


gcctgctcag 


tgggcagctg 


actctagctg 


5520 


ctgcaaaatg 


cctttcactc 


aaaggttttt 


gcttttgcca 


atcccctccc 


tcacatgcct 


5580 


tgaaacaagc 


actttcaaag 


acaaagacat 


aaacaacaaa 


agggtgcagg 


ctgagttgcc 


5640 


aacttacagt 


gtcattgggc 


cgattcaggt 


tcttgactgc 


tgcacaaaag 


aatttgagag 


5700 


caagtacaaa 


gcaaaagtag 


gtaaagaagt 


ttattgcaaa 


gcgaagatct 


cctgggaggc 


5760 


ccccgtggag 


aagaagactg 


agtgtatcca 


gaaagggaag 


aacaaccagg 


tgggtgcttg 


5820 


gacgctgctc 


ctggtgctgc 


cttcacccca 


ggacgtctcc 


tcccattctg 


gccctcgcgc 


5880 


tctcactaac 


cggacacctt 


tctgccccca 


gaccgagtgc 


ttcaacttca 


tccgcttcct 


5940 


gcagccctac 


aatgcctccc 


acctgtacgt 


ctgtggcacc 


tacgccttcc 


agcccaagtg 


6000 


cacctacgtc 


aacatgctca 


ccttcacttt 


ggagcatgga 


gag 1 1 1 gaag 


atgggaaggg 


bUoU 


caagtgtccc 


tatgacccag 


ctaagggcca 


tgctggcctt 


cttgtggatg 


gtgagctgta 


6120 


ctcggccaca 


ctcaacaact 


tcctgggcac 


ggaacccatt 


atcctgcgta 


acatggggcc 


6180 


ccaccactcc 


atgaagacag 


agtacctggc 


cttttggctc 


aacgaacctc 


actttgtagg 


6240 


ctctgcctat 


gtacctgaga 


gtgtgggcag 


cttcacgggg 


gacgacgaca 


aggtctactt 


6300 



WO 02/068633 PCT/US01/43612 

67 



cttcttcagg 


gagcgggcag 


tggagtccga 


ctgctatgcc 


gagcaggtgg 


tggctcgtgt 


6360 


ggcccgtgtc 


tgcaagggcg 


atatgggggg 


cgcacggacc 


ctgcagagga 


agtggaccac 


6420 


gttcctgaag 


gcgcggctgg 


catgctctgc 


cccgaactgg 


cagctctact 


tcaaccagct 


6480 


gcaggcgatg 


cacaccctgc 


aggacacctc 


ctggcacaac 


accaccttct 


ttggggtttt 


6540 


tcaagcacag 


tggggtgaca 


tgtacctgtc 


ggccatctgt 


gagtaccagt 


tggaagagat 


6600 


ccagcgggtg 


tttgagggcc 


cctataagga 


gtaccatgag 


gaagcccaga 


agtgggaccg 


6660 


ctacactgac 


cctgtaccca 


gccctcggcc 


tggctcgtgc 


attaacaact 


ggcatcggcg 


6720 


ccacggctac 


accagctccc 


tggagctacc 


cgacaacatc 


ctcaacttcg 


tcaagaagca 


6780 


cccgctgatg 


gaggagcagg 


tggggcctcg 


gtggagccgc 


cccctgctcg 


tgaagaaggg 


6840 


caccaacttc 


acccacctgg 


tggccgaccg 


ggttacagga 


cttgatggag 


ccacctatac 


6900 


agtgctgttc 


attggcacag 


gagacggctg 


gctgctcaag 


gctgtgagcc 


tggggccctg 


6960 


ggttcacctg 


attgaggagc 


tgcagctgtt 


tgaccaggag 


cccatgagaa 


gcctggtgct 


7020 


atctcagagc 


aaggtaaagc 


tgctctttgc 


cggctcccgc 


tctcagctgg 


tgcagctgcc 


7080 


cgtggccgac 


tgcatgaagt 


atcgctcctg 


tgcagactgt 


gtcctcgccc 


gggaccccta 


7140 


ttgcgcctgg 


agcgtcaaca 


ccagccgctg 


tgtggccgtg 


ggtggccact 


ctggatctct 


7200 


actgatccag 


catgtgatga 


cctcggacac 


ttcaggcatc 


tgcaacctcc 


gtggcagtaa 


7260 


gaaagtcagg 


cccactccca 


aaaacatcac 


ggtggtggcg 


ggcacagacc 


tggtgctgcc 


7320 


ctgccacctc 


tcctccaact 


tggcccatgc 


ccgctggacc 


tttgggggcc 


gggacctgcc 


7380 


tgcggaacag 


cccgggtcct 


tcctctacga 


tgcccggctc 


caggccctgg 


ttgtgatggc 


7440 


tgcccagccc 


cgccatgccg 


gggcctacca 


ctgcttttca 


gaggagcagg 


gggcgcggct 


7500 


ggctgctgaa 


ggctaccttg 


tggctgtcgt 


ggcaggcccg 


tcggtgacct 


tggaggcccg 


7560 


ggcccccctg 


gaaaacctgg 


ggctggtgtg 


gctggcggtg 


gtggccctgg 


gggctgtgtg 


7620 


cctggtgctg 


ctgctgctgg 


tgctgtcatt 


gcgccggcgg 


ctgcgggaag 


agctggagaa 


7680 


aggggccaag 


gctactgaga 


ggaccttggt 


gtaccccctg 


gagctgccca 


aggagcccac 


7740 


cagtcccccc 


ttccggccct 


gtcctgaacc 


agatgagaaa 


ctttgggatc 


ctgtcggtta 


7800 


ctactattca 


gatggctccc 


ttaagatagt 


acctgggcat 


gcccggtgcc 


agcccggtgg 


7860 


ggggccccct 


tcgccacctc 


caggcatccc 


aggccagcct 


ctgccttctc 


caactcggct 


7920 


tcacctgggg 


ggtgggcgga 


actcaaatgc 


caatggttac 


gtgcgcttac 


aactaggagg 


7980 


ggaggaccgg 


ggagggctcg 


ggcaccccct 


gcctgagctc 


gcggatgaac 


tgagacgcaa 


8040 
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actgcagcaa cgccagccac tgcccgactc 



<210> 74 

<211> 435 

<212> DNA 

<213> Homo sapien 

<220> 

<221> mis cofeature 

<222> (86) . . (86) 

<223> a f c, g or t 

<400> 74 



ggatgttcga 


tcactatagc 


cttgtgcctc 


gatgtcgcgg ccgaggtact 


tcccanggtg 


gcaacttctc 


agagatacga 


aaaaactaaa 

33333*-' oa '3 3 


ctgcagattg 


ttaaatgaaa 


t~ 3 1~ a an a t~ r* a 


gatcatttca 


tagaaattag 


ggtagatttt 


aaggcattat 


ttgaaaagtt 


tttctaacat 


acactacatt 


taaaagcaat 


tatttatact* 


ttatctgttt 


tactt 




<210> 75 

<211> 608 

<212> DNA 

<213> Homo sapien 




<400> 75 
ggggggcata 


tttctctgtg 


tcctactctg 


ggttttccca 


tctgggaaat 


ggggtgaaag 


gagaaaaaga 


aaagtcagtg 


atataaatag 


atttcaacta 


ctactggaga 


atttaataaa 


gatttagggt 


tttttttttt 


agaggggaca 


cactatcccc 


aaagaaggag 


ctgtggggga 


gcatattctc 


atctcgggcc 


ccgaggctgc 


gagacagatc 


acagaccacg 


agtgcctttc 


accagctctt 


tccctctcta 


gacagaaata 


catgaagtac 


gaggaaaact 


tgaattccag 


ccagaaat 
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caaccccgag 


gagtcatcag 


tatga 


8095 


tagatgctgc 


tcgagcggcg 


cagtgtgatg 


60 


ggggccatat 


ttctctgtgt 


cctactctga 


120 


gttttcccat 


ctgggaaatg 


gggtgaaagt 


180 


gcagaaaaag 


aaaagtcagt 


gatataaata 


240 


tatttcaact 


actactggag 


aatttaataa 


300 


agatttaggg 


tttttttttt 


tagagtggac 


360 


atccagattt 


tttattatct 


gaaaatgaaa 


420 








435 


agcaacttct 


cagagatacg 


agggggctag 


60 


tctgcagatg 


ttaaatgaaa 


tatagaatca 


120 


atcatttcat 


agaaattagg 


gtagattttt 


180 


aggcattatt 


tgaaaagttt 


ttctaacata 


240 


cactcccttc 


ctactcccca 


acaaggggct 


300 


cccacgacgc 


agccccggta 


cgggattaca 


360 


ctgtggggcg 


aggggagacc 


tcccatcacg 


420 


ccggaccggg 


acgtggcctc 


cagagcaggc 


480 


ttttggtaag 


gttctggggc 


agggagggag 


540 


attcttaatg 


caaagtattt 


atcatttcta 


600 



608 



WO 02/068633 



PCMJS01/43612 



<210> 76 

<211> 727 

<212> DNA 

<213> Homo Bapien 



<400> 76 
gatgattcga 


ctcactatgg 


gcgaatgtgc ctctagatgc 


tgctcgagcg gcgcagtgtg 


60 


atgggtcgcg 


gcgaggtact 


tcctagacac tgtagtttaa 


aagaataatt tcctgtagaa 


120 


aaacctaatc 


taataggacc 


tactcataat ttggcaaaat 


catcctcata attccatgct 


180 


actgaaatgg 


gggtgtaata 


aagaaaaaat catttgttta 


ttcatttaac agggtttatt 


240 


aaagcaccaa 


tatatgccca 


gaactttgca gtgcactgga 


agttcagaga taagaaagcc 


300 


acagtttctg 


ccatcaaaat 


gctaggaatc tagaggtctc 


aaacatcaga ggtcccttca 


360 


aactatgtct 


accaggaaaa 


taaaatacct gatttcatcc 


ctgcaaacaa ctctatacag 


420 


acagaaatgc 


ttcctagctt 


ggaaactata gaaaattgaa 


catgaaaatg agagcaaaga 


480 


ttctacatca 


aaatggcaat 


gatccaatca gcccagtcaa 


ggctgagtgg gttgaatggg 


540 


gacttcgagt 


ttggattcaa 


tgctttgaac ttcatagctc 


cagagaagct gtgcaaaaag 


600 


gggggattct 


agggaatctg 


aaaaaaatto ttacraaaaac 


cagctttctt cttgtctcct 


660 


aatccctgct 


aagattgata 


ccagtcaagg agctgagtta 


ctttcatttc attatgttga 


720 


ccaagag 








727 


<210> 77 
<211> 3052 
<212> DNA 
<213> Homo sapien 








<400> 77 

ggaggtggag gggaggagtg 


cctgcatggc catgggattg 


ttcttcattc cctttctcaa 


60 


ctgcacccag 


cagcagtggt 


ttttgctagg ccttttgaag 


acagcaggaa tctgggagaa 


120 


ggaacatcat 


cgtctttcac 


agcatggaaa catcaatctt 


attccagaga agggaagaag 


180 


tccccaaagg tatgtccggt 


ttaacagttt ctcaagtggg 


ccaggaagtt ctttttcatg 


240 


ttctgggctc aatcgtgatg 


ctttgatttc acttggtatt 


ttacttttag ttttgtctct 


300 


aacatctgga gcaaagatca 


gaagacctga gttccagatt 


tattctgtga ctcaatcact 


360 


gcttcaatca 


ctgagggacg 


tggtatgatg ttctttgctc 


tgagcttctg cttcttgatc 


420 


taaaaagaca ggtggttgtt 


ctgggtgatc aaaatctctt 


cttgctctgt gtatcttaga 


480 


atccatgaaa 


ttatgtctct 


ttgcatctat gataacaaat 


caaaatatga atcacaattt 


540 
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t~ p t~ t~ a pa nnn 


i»tyu ^yayyu 


w a a a y *•>>• w ^-j **y 


dday \*\* uy y y 


aacaaatctc 


cccaaaoaaa 


600 


crt* hhaa ptp a a 


y I* a L. L> aau^ ut> 


tttcttctae 


oay l> k,uu v 


at teat tact 


aaaatattat 
a yyy *-y uu y u 


660 


I* L>ya I- ay K~ \*rGL 


\*\-,gl u*y v^a^^ay 


hhaactttrc 


vVsyyyaaauw 


ataaotcttt 


tatttttgea 


720 


att taact t - 1 


at~pt"ppt"apa 


rafra t* ppari 
v~>a u.Up>a uwv>bh 


ai*crCTt"ar , a a c 

a i>y y ty i*aa\s> 


ate taa tact 


atatotccaa 

VL ^p>CA ^p*>J w «kp> V«p Up4 WpV 


780 


tcrcraat tcrcc 


acaactcaoc 


cactoaoatc 


c aac at t c c t 


gcctctcttc 


attccaagat 


840 


tccttctgtt 


ctttattata 

*w U* Up U* C^J. U- 


gtagacaaga 


gcaagcattt 


ttatttgcag 


agatgtactt 


900 




c a a t - t crt* a a a 
uaa l> u»y l. y act 


aeicatttcfat* 


ttaataatta 


crcrattatttc 

y y a uv-y v«wwv* 


tccacaattc 

Sp- W ^pP Vp*^pJ LpI> VpP VpP \pP> 


960 


tttttLtyaa 


aaaaaaH*fh 


V f- nn t~ pt t- a a 
u uuyy uy tao 


a +- f- a era rrh +* 
dtLL ay ay u u 


pah a arrf- a a r* 


arraaaaact" 

aL> Up* aaay av> l* 


1020 


y ay a^cia uy a 


ant* prahnna 
ay LLLdLyk>a 


r* a r"t"fft*aa a a 
LcLL>i>y u.aaaa 


aaat*aar*at*a 
dddUddLdLd 


aat*aaraafp 

a a i» a a s» ay \* 


cctacftatac 


1080 


al-f<a <-rt* p a i~ t* 


nhart^a rra tier 
y Lai* L-ay &yy 


ct ttcrtatcrt 
l. u. wy i-y ty t- 


a cat tcratto 


tctcacttaa 


aaatcaaaac 

t* C* C-k d 


1140 


sroaracraacr 
33 y ^••■y 


a atraaahor 

ay u» waac* tyw 


ttttaaatat 


crctcra.aa.ctc 


aaattttctt 


ccaggtcatc 


1200 


tchatatcrat 


tcctctttta 

U« U* \* W W W W w w CL* 


cactgccatt 


actggcttct 


aaaaccacta 


aagctatgea 


1260 




irrhaa pa tcr 

L>Va>Vi> i>aciwc*. u»y 


ttctaaacha 


oacattaofca 


aaaacrc tcrc c 


tgctggcttg 


1320 


ffha ptpjpjytp p 

t uuayyyy 


arrt" t~ ptpt p f" p a 
cty l. tyyu L.v*a 


yii **yy ac *^y 


y LULaaLauv^ 


a err 1 aac a at - a 

ay \~r ci cl Vp«* a cl u» ci 


cr c a 1 1 cr cr cr 1 1 

y u* a l» Vp>y yy u l- 


1380 


nahht* na aha 
ydt U L.ydCtUct 


^aaapaa^H* 


a r*ra phhhaaa 
ayau L. tLaaa 


a rrf~ f pat*ppf 
ay LUL>aLL(>t 


rraa a a a paap; 
yaaaaa^aay 


ppi"i"paacffla 
out Lvaayya 


1440 


ra anf na nna 


ra^naanraa 


diiu^ uaayy 




l"|*pa crria a nr 
l> Lv^ayyaay^i 


aaacraacraat 

aaay aay aa u- 


1500 


ctttacrttat 


f- pa r-rra aaap 
u>u^ a i>y aaaau 


raaataccacf 

waaai»av^x< ay 


(m> bciv>y t»yy^a 


t c 1 1 c t Cf c/ era 

IpX* v»i».Wi«.yyya 


aaacacaaaa 

"■"3 ^3 3 3 


1560 


rrtTraoacia a t 

3*333 3 


pp t- aaaot" o t~ 
« i, yyy3 t 'y * 


atttaacaac 
~h ^ »33* a y » 


actgtccaaa 


gtacagttga 


cttcttaggc 


1620 


t*nrhaaaraa 
uy L>uy uQitcia 


a t ttcttctr 


hhorrrraao 

v vyvVrfV^wayy 


acraatt tcra t 


ctcrcaaat tc 


ccataaataa 


1680 


ay l~ d ci cl L- 1» U 


t* t* p t* p f~ f~ trra a 
l_ L-t— L. Lyaa 


aL>ctyy uyi» uy 


L>y uuaaay 


♦"nfa^pahaa 


crct* t" pi - p t~ t" n 
yVp^i«LrV^L>v«*u<iU«y 


1740 


oaa fah a a 
y UCctclL-ct Uctcl 


4— /— * a a a I - rra a a 

tyooatyaaa 


y Laat L^ayu 


4- r 1 r 1 1~ hoarhn 
t^LLtyaLuy 


yLdu^adit u 


4- a rrpariCTcra t" 
Layuayyyai> 


1800 


Lay y ay au aa 


fraanaaaapt" 
yaayctaayL L. 


on i _ t"t*r , +"r , r'a 
yy UL> l> \» l> v_« a 


acaatcthrr 


t*paciai*l*ppp 
l» w ay aLucv^i 


u> ay aa u» u u^ u»* w 


1860 




pa*pfpi~t"pt*pt* 




a crt" hraaa erf 
ay L^oaaay 


a t~ +* era atrra 
au L*y a a u v«. ^ a 


a a p t" poa a ot" 
ciciUpP Up u^y aay l» 


1920 


U^l^Uu^aipLpV^aa 




u l. iyau uyy y 


f t a a 1 1 Cf a a 1* 


r» a t* t" <to pa i*t" 
LaL L»y v_» L» a i» Lp 


t" tcr at"a t" acra 
Up L-yauy u. ay a 


1980 


atetttyuue 




rrl-4-r<aa 
ycuLaaitui 


pt'a^anfU'p 
LLdt ay 


LddyL tdyyd 


•a Cl f a t" p t" PT 
dy (^dLLuC L.y 


9 n a. ft 


tctgtataga 


gttgtttgca 


gggatgaaat 


caggtatttt 


attttcctgg 


tagacatagt 


2100 


ttgaagggac 


ctctgatgtt 


tgagacctct 


agattcctag 


cattttgatg 


gcagaaactg 


2160 


tggttttctt 


atctctgaac 


ttccagtgca 


ctgcaaagtt 


ctgggcatat 


attggtgctt 


2220 


aataaaccct 


gttaaatgaa 


taaacaaatg 


attttttctt 


tattacaccc 


ccatttcagt 


2280 


agcatggaat 


tatgaggatg 


attttgecaa 


attatgagta 


ggtcctatta 


gattaggttt 


2340 
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ttctacagga 


aattattctt 


tcaaactaca 


gtgtctagga 


agtaccttgc 


acaaaataga 


2400 


cacaggatgc 


ataattgttg 


attaattagt 


tcattttaac 


ctatacacct 


tggggttatc 


2460 


tagttatgtt 


ttcatgaaaa 


ctagggcttt 


gtaatatcaa 


tgtcatgtct 


aataagccta 


2520 


aatttccagt 


tctaaaaaca 


aagttattca 


acaacacttt 


ctcaatttga 


tgtgtttttc 


2580 


ttttcctgaa 


ataatttttt 


ttcttaagct 


tttgatattt 


gcttttgtgc 


tgacatttga 


2640 


tagtcaaaac 


tcaaacgatc 


tcaagtactt 


ctttctcagc 


tcaataggat 


catgtttcgc 


2700 


ttaacatttg 


tgaagttgat 


ttatctaat.t 


cattttgatc 


ttgctaaaat 


atgaccttta 


2760 


aaatttttca 


tgtcacattg 


tgacccagaa 


agaccaaata 


catacatgag 


atcauagttu 


282U 


tacctgataa 


tatcactcct 


agtttgatga 


tggaattaat 


ttgtcaatat 


ttcatggagt 


2880 


tctgctgaaa 


cgattgcaat 


ttcttgggca 


agtttcagaa 


cgaccagcat 


atctgtggtc 


2940 


atgtcatctt 


gggtcatgtt 


aaccagttcc 


attggattat 


catgctgttc 


ccttgtctga 


3000 


caaaaggatc 


tttgtgaaag 


cacacactgt 


ttcttcccat 


aactcacatt 


ca 


3052 


<210> 78 

<211> 416 

<212> DNA 

<213> Homo sapien 












<400> 78 
gccgcccggg 


caggtagcct 


ttttctctcc 


agccttgaat 


tgttccctgt 


tggcttccca 


60 


agggcccatc 


tgctggtaca 


gtccacactt 


ccaaagccaa 


gacccgagag 


ggctttcact 


120 


gccccaagcc 


tctctcctgt 


gaccttggga 


ttctgtcttg 


gcagaatcct 


ttgtcagcgg 


ion 


ctcttgctct gtccttcctg 


tttggccaca 


gctctttcaa 


tcaatgggta 


ttctagaacc 


240 


gcaggatgtc 


agagctggaa 


gggacgcgat 


accggtttac 


acaaggggaa 


actcctcgag 


300 


gctctgggag ggacggaggg 


ttttggtgac 


agagcgagag 


ctaaaattga 


ggattcctga 


360 


atccagatct 


tgcctcccat 


cagccatctt 


tctcccaata 


aatttatgtt 


atgtgc 


416 


<210> 79 
<211> 1451 
<212> DNA 
<213> Homo sapien 












<400> 79 

taaaaacttt gtcccgatcc 


atccagaaaa 


gagtaggtag 


ctgcatcctg 


acagcctggc 


60 


aaagtcaaga 


aagttgaagg 


agaaacatac 


ctttggagag 


ggggttttct 


ttaaaactag 


120 


tgttaagaaa 


tgcttaggga 


tttttttttc 


ttatttttca 


taactaaagc 


tttcacccag 


180 
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agccggctct 


gtttgcactt 


tgctgccgac 


gagtctcatt 


ctgtcaccca 


ggctggagtg 


ctgccctcca 


gggttctggc 


aattccgcct 


tgcgccacca 


cacccggtta 


atttttgtat 


ggcaggatgg 


tcttgaaccc 


ctgacctcag 


ctgggattac 


aagtgtgagc 


catcgcgccc 


gtaaacaaat 


ccttaggtta 


tctttgctgt 


aagtagaaat 


agctaggcat 


tattttttga 


actgaacaga 


gggaagcaaa 


ccttttggaa 


gtttgatgct 


ggaagcatga 


acagcttcat 


tgtcctttca 


qcctaaqqqq 


gtggaaggga 


attqcqaqtq 


tgtggagaca 


aaggcagttc 


gtttcctcgc 


ctgcgatgct 


ccttgtacgt 


gctgggtcaa 


aggtggcctt 


tttctctcca 


gggcccatct 


gctggtacag 


tccacacttc 


ccccaagcct 


ctctcctgtg 


accttgggat 


tcttgctctg 


tccttcctgt 


ttggccacag 


caggatgtca 


gagctggaag 


ggacgcgata 


ctctgggagg 


gacggagggt 


tttggtgaca 


tccagatctt 


gcctcccatc 


agccatcttt 


taaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaatgagcgg 


c 





<210> 80 

<211> 1336 

<212> DNA 

<213> Homo sapien 

<400> 80 

ccgaggtaca aaggctttga ggtccatgga 
ggtgctttga ccctgccata ccctggctat 
tatgaatatg tcagtaataa tccagcacac 
tgtgcaacat cttataaaca atgtcatttc 



PCT/US01/43612 



72 

attgcaaact 


ttttggcagg 


gtgggagact 


240 


cagtggcccg 


atctcagctt 


actgcaacct 


300 


cagtctcctg 


agtagctggg 


attacaggca 


360 


ttttagtaga 


gaccaggttt 


catcatgttg 


420 


gtgatctgcc 


catctcggcc 


tctcaaagtg 


480 


ggcctgcaaa 


cttttttgta 


ggtatttctg 


540 


ggttgtggtt 


tggctttagt 


catgatttca 


600 


aatatatgac 


ctatatgtag 


tcaagaatcc 


660 


actggctttt 


gggcagacag 


taaacgtcca 


720 


caggtaggta 


ctcctcaact 


ctgatgagtt 


780 


gttgtttgag 


aatagcaaat 


acgcatgttg 


840 


ccaccacagt 


taggtcctgg 


ccgttgtgtt 


900 


cctcaccctc 


ctctcccgcc 


tctgccttct 


960 


gccttgaatt 


gttccctgtt 


ggcttcccaa 


1020 


caaagccaag 


acccgagagg 


gctttcactg 


1080 


tctgtcttgg 


cagaatcctt 


tgtcagcggc 


1140 


ctctttcaat 


caatgggtat 


tctagaaccg 


1200 


ccggtttaca 


caaggggaaa 


ctcctcgagg 


1260 


gagcgagagc 


taaaattgag 


gattcctgaa 


1320 


ctcccaataa 


atttttgttt 


tgtgcaaggc 


1380 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1440 
1451 


ctatacttgt 


ccccatttat 


catcccaggt 


60 


taagataaaa 


agatttgtgg 


acattaaaat 


120 


attgaaatat 


tgacacagat 


taccataatt 


180 


catagtagtc 


taaggcttca 


ccagcctggc 


240 
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ccactgtatc tagactttag 
gggaagttga taaatatcac 
caattaaatt ggaaaaggta 
aatctacaaa aggaaattca 
tgcagtatca aatgtccaaa 
aactttgtca aaaatgcctt 
gttaggaaga tgtgcattga 
attaaacatg gacatgtctc 
atagcggttc ccggggggaa 
cccccccacc cccaaccaca 
acaaacacac aacaaaaaaa 
ccacaacaca caacaacccc 
cccgcaccga cccccacaca 
cacacacaca cacactccca 
aacaccaccc caccacacca 
gaccacccac cacacacacc 
ccaccacacc acacccaaca 
catcatcact cccctctaca 
ctccccacac atcacc 

<210> 81 

<211> 1605 

<212> DNA 

<213> Homo sapien 

<400> 81 

atttataagg cccttcaaat 
ggggagaaaa accccaccat 
aagtcaattt gtagtcagtc 
tgaggtttag aattccttca 
gagggggagg atctgaagga 
tacttgtccc atttatcatc 
ataaaaagat ttgtggacat 



73 

gttcatttta ataattatgc tttccttctc tgtatcattt 300 

ttccttagat accttcattc agtgatatat ctggctttta 360 

agtttctctt tggtgggttg agagttggac catcaattct 42 0 

tgatttcact ctgacgccta ggatctagcc aaggctggtc 480 

ctcatctact attagccata ttttgtgagt cgtttgtcta 540 

tgccatgatt ttgttgctat ctggatttca aacatggaca 600 

agtaggaaaa tttgttcaga ttgctgtatt tattttttac 660 

tcaaaaaaaa aaaaaaaaaa ggctgggggt actcggggcc 720 

atgggtaccc gcccacaatc ccacacaaca accgagcacc 780 

atacatataa cacaaacctt acataaaaaa cactaacaca 840 

acacaccaac acacaaatca ccataacaca acaacataca 90 0 

gcccaccccc caccactcaa caccaccaca caacacaaca 960 

cccccgcccc acccaccaca cgccccacca caccaccacc 1020 

ccacacacac acaccccccc caacacccac ccacccccac 1080 

cgcacccaca ccccacacac cgccaccacc acacccccac 114 0 

cacacacccc cacacccacc cacaccccca cccctcccca 1200 

tctcactcca ctctctcccc ccaccaccca cactcaacca 1260 

cacaacaaca tcaccaacac tccacctgca cacacactca 132 0 

1336 



ttgtggcttc 


ctttctcata 


cttctcaagt 


ataatgaaag 


60 


caacacaaaa 


gaaggctata 


aagactgtgc 


accttttaac 


120 


cctgggcctg 


tctttttttt 


tttttaattt 


tgaagctacc 


180 


gccctagctg 


cttttattct 


gctttttatt 


taaacaaaaa 


240 


aactagtttt 


ctgtacaaag 


gctttgaggt 


ccatggacta 


300 


ccaggtggtg 


ctttgaccct 


gccataccct 


ggctattaag 


360 


taaaattatg 


aatatgtcag 


taataatcca 


gcacacattg 


420 
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aaatattgac acagattacc ataatttgtg caacatctta taaacaatgt catttccata 480 

gtagtctaag gcttcaccag cctggcccac tgtatctaga ctttaggttc attttaataa 540 

ttatgctttc cttctctgta tcatttggga agttgataaa tatcacttcc ttagatacct 600 

tcattcagtg atatatctgg cttttacaat taaattggaa aaggtaagtt tctctttggt 660 

gggttgagag ttggaccatc aattctaatc tacaaaagga aattcatgat ttcactctga 720 

cgcctaggat ctagccaagg ctggtctgca gtatcagatg tccaaactca tctactatta 780 

gccatatttt gtgagtcgtt tgtctaaact ttgtcaaaaa tgcctttgcc atgattttgt 840 

tgctatctgg atttcaaaca tggacagtta ggaagatgtg cattgaagta ggaaaatttt 900 

gttcagattt gctgttattt attttttaaa ttaaaaatgg aaatgtaaaa aaaaaaaaaa 960 

aaaaaaaaag gctgggggta ctcggggcca tagcggttcc cggggggaaa tgggtacccg 1020 

cccacaatcc cacacaacaa ccgagcaccc ccccccaccc ccaaccacaa tacatataac 1080 

acaaacctta cataaaaaac actaacacaa caaacacaca acaaaaaaaa cacaccaaca 1140 

cacaaatcac cataacacaa caacatacac cacaacacac aacaaccccg cccacccccc 12 00 

accactcaac accaccacac aacacaacac ccgcaccgac ccccacacac ccccgcccca 12 60 

cccaccacac gccccaccac accaccaccc acacacacac acactcccac cacacacaca 1320 

cacccccccc aacacccacc cacccccaca acaccacccc accacaccac gcacccacac 1380 

cccacacacc gccaccacca cacccccacg accacccacc acacacaccc acacaccccc 1440 

acacccaccc acacccccac ccctccccac caccacacca cacccaacat ctcactccac 1500 

tctctccccc caccacccac actcaaccac atcatcactc ccctctacac acaacaacat 1560 

caccaacact ccacctgcac acacactcac tccccacaca tcacc 1605 

<210> 82 

<211> 952 

<212> DNA 

<213> Homo sapien 

<400> 82 

tatatatagg cgcatgggct aaatgatcat gctcgagcgg cgcagtgtga tggatcgtgg 60 

tcgcggcgag gtaccaggtg aagtgattgg cctgcagtta gggctgtgtt gtgcaaaaat 120 

cacttgtttt ggggtgttag aaccacattt aggcgagaag atcacttttg gggagcttgg 180 

gaactgaggc aggccgcagg tgcagcagag gcatgagctt gcccgggccc gccctcttgt 240 

ctatgctcat ggagtgaagg gggggccagc ggaatggccc aaacaactga tttgtttttc 300 

tttttttaaa tctttttcag acaaatacca ttgtgtttaa gcgaaatgtg tgtataatgc 360 
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caaatcactg 


taccccacaa 


ccctgcacac 


75 

ctctacactg 


gactgtaatt 


tcttgttcct 


420 


agtttgtctt 


gctagactgt 


aagctccgtg 


agagcaggga 


ccgtgtctgc 


ttgttgagtg 


480 


ggctttcccc 


tgtgcctgcc 


agcatgcctg 


gcatctagca 


ggtcttctgt 


aaagatggga 


540 


tgagtttgta 


aaccctccag 


cctcaggagt 


ggctccatcc 


tctgaggctc 


tggggccctc 


600 


tggcaggcta gtcatttttc 


tgccatgtac 


gtacaatgct 


ttattttcat 


gttgtatttt 


660 


cctttctagc 


cagtaagcaa 


agctcctact 


ggagcaagtt 


tctttgtggt 


gttatctctg 


720 


tatcctttca 


tcagggccag 


gccagagtaa 


gtaggtgctt 


cgaaggtgtt 


tggtaatgaa 


780 


tgactttggt 


accccgccca 


gtctggctca 


ctgtctgcgg 


ttgtctaatc 


tagatgaagc 


840 


ccgattcgaa ggggggattt 


ccggagtgct 


tggggttgga 


aaaatctctt 


attgccgggt 


900 


actcattggg tcgggcgggg 


gcgagtgggg 


actgctggtc 


ggggatgtcg 


cc 


952 


<210> 83 

<211> 1933 

<212> DNA 

<213> Homo sapien 












<400> 83 
cacccttttt 


gatagcatgt 


tatgaggctc 


accaaggtct 


atcctctggc 


agtatccctg 


60 


gcctgttctc 


cccatctccc 


ctgccctctg 


ctcaccagtt 


ccctaaatgt 


atcttgatct 


120 


cactagccct 


acagactgtg 


cacagagctg 


tttctacctc 


cagtgacatg 


cttccgccca 


180 


ggcttctccc 


cctcccgcct 


caatcttccc 


ctcaccaact 


taaggtctta 


ggggcccttc 


240 


taggaggcag 


tccccaactt 


ccagagccag 


ggtataggct 


acttctatgt 


gtcccatggt 


300 


actgccacag 


aagagtctca 


gctctctcca 


ggattcagtt 


ctaaggtcag 


tgcctaagat 


360 


aaaaatggag 


tgtaattaaa 


attcctctta 


gaaatctaag 


gaaggtgccc 


tattgaagac 


420 


caacatcttg 


aggtcccatg 


tagtcatttc 


ttgcccatgt 


gggaacacat 


tactgtttgg 


480 


ttgagtacca 


ggtgaagtga 


ttggcctgca 


gttagggctg 


tgttgtgcaa 


aaatcacttg 


540 


ttttggggtg 


ttagaaccac 


atttaggcga 


gaagatcact 


tttggggagc 


ttgggaactg 


600 


aggcaggccg 


caggtgcagc 


agaggcatga 


gcttgcccgg 


gcccgccctc 


ttgtctatgc 


660 


tcatggagtg 


aaggaggggc 


cagcggaatg 


gcccaaacaa 


ctgatttgtt 


tttctttttt 


720 


taaatctttt 


tcagacaaat 


accattgtgt 


ttaagcgaaa 


tgtgtgtata 


atgccaaatc 


780 


actgtacccc 


acaaccctgc 


acacctctac 


actggactgt 


aatttcttgt 


tcctagtttg 


840 


tcttgctaga 


ctgtaagctc 


cgtgagagca 


gggaccgtgt 


ctgcttgttg 


agtgggcttt 


900 


cccctgtgcc 


tgccagcatg 


cctggcatct 


agcaggtctt 


ctgtaaagat 


gggatgagtt 


960 
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tgtaaaccct 


ccagcctcag 


gagtggctcc 


atcctctgag 


gctctggggc 


cctctggcag 


1020 


gctagtcatt 


tttctgccat 


gtacgtacaa 


tgctttattt 


tcatgttgta 


ttttcctttc 


1080 


tagccagtaa 


gcaagctcct 


aatgagcaag 


tttctttgtg 


gtgttatctc 


tgtatccttt 


1140 


catcaggcca 


gcccagagta 


agtagtgctt 


cgaaagtgtt 


tggtaaatga 


atgaattatg 


1200 


gtaacccagc 


ccagttctgg 


catcactgtc 


tgcaggttag 


tctaattcca 


gatgaaagcc 


1260 


ccagattcga 


aaaagggaag 


attattcaga 


agtgcatatg 


gttgtataag 


aaaaaaaaat 


1320 


ctcattaaat 


agacctgagt 


aaacttcaaa 


ttttgatgat 


ctggccatgt 


cggagctgaa 


1380 


gcttaccagc 


tctctaaagc 


atttgctagg 


gaatcatgca 


gatggtccaa 


aatatttttt 


1440 


agaaaagcat 


tcatctgtcc 


cgggatagaa 


gtagaggaag 


ctggtaggca 


ggcgctagac 


1500 


cctaaagaag 


gactgctgag 


ccagttctta 


agccagtggc 


tcttttgcct 


aaagcagcat 


1560 


tcctcaaact 


tctcactcga 


agcattaagt 


tacggtgaga 


tgctaacaca 


cgctttagac 


1620 


ttaagtcaaa 


atttactgtg 


aagccacagc 


aaaccatggc 


ttcacagatg 


aattagttgc 


1680 


tacacaaata 


agttcgatac 


ataagtacag 


taattgcttt 


taaaccttat 


aataagcagt 


1740 


aaattagaag 


ttaaaccatg 


tttttctttt 


acgaactgaa 


agaagagaat 


gcttttgata 


1800 


ctgagaatcg 


cacactgtgt 


ccaggcagat 


tgttcttgca 


taaccagacc 


cacaataaat 


1860 


atctacaact 


tttcttacat 


gtttataaaa 


taaatcttag 


aaaaaaaaaa 


aaaaaaaaaa 


1920 


aaaaagggcg 


gcc 










1933 


<210> 84 

<211> 376 

<212> DNA 

<213> Homo sapien 












<400> 84 
gcgtggtcgc 


agccgaggta 


ctccactcca 


aatttcccaa 


gaaattcaga 


agaattgtga 


60 


acaagttgct 


ggtttcacaa 


tactgcaagg 


acactgcaag 


gttattccaa 


gttcctcagc 


120 


aagtgtttac 


acattgggcc 


aaggacagat 


ttttcctgga 


gaaggatttt 


accactgcca 


180 


ccatcttgaa 


attcttcatc 


gttttggaac 


acagagccat 


agattttcat 


ttctgcactc 


240 


agctctgttc 


tgagaccggg 


gccatagggg 


ttcttggaga 


gacagggcag 


atggaagaag 


300 


tggaaggcat 


ctgccacact 


gtagagtgcc 


tttaagccac 


ccccattgcc 


tgagttgttt 


360 


tcctttttta 


caaatg 










376 



<210> 85 
<211> 1325 
<212> DNA 
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<213> Homo sapien 
<400> 85 



ggccgcactt 


tttttttttt 


tttttttttc 


agagacggag agatttattt taaggaattg 


60 


tgtggctggc 


aagtccgaaa tccatagggc 


aggctggcag gctgaaaagt taagtgaggg 


120 


ttaatgccat 


agaccggagt 


cctgagtttg 


aattccacag ggcaacaggc 


tggaaactca 


180 


gatagggttt 


ctatgttgta gtctcaagga 


gaattccatc ttctttagga 


aacctcaatc 


240 


tttgttctta 


atgccttcaa 


ctgattggac 


agggcccacc cgtagtatgg tgggtaacct 


300 


gctttactga 


tttaagcgtt 


aatctcatct 


aaaaaaatac ctttaccacc 


atgtccagac 


360 


tagtgtttga 


ccaaacaatg 


agttggcata 


ctccacgcaa ggtgacgtgc aaaatgaacc 


420 


atcacagcac 


ttcttgtgtc 


ttcatgtgcc 


cacctggtac atgggggtac 


aactatctgg 


480 


cccaaagggg 


tggtcaatcc 


tgagtggata 


aataatagga atttagtcct 


atgatgtgtc 


540 


ttgatttttc 


ttttcttttc 


ttaccttcct 


atttttatta acccgggtct 


tctgcgccca 


600 


aagactcagc 


ctcattcagg 


accatattat 


gttcatactt ctgctgcgtc 


caaggagtgt 


660 


tgacgaaaaa gtggggcctt ggaggggtag 


aggccaaggg acagtttccc 


ctctgccctt 


720 


tgaagttcac 


gatcttccat 


gcaacaaaat 


tgttttctgt gaaaagcagg 


aaatgaataa 


780 


caacagcgta ggtgcgttgg 


ctatgtccgg 


tggcatttct tcagaatttt 


cattaatgac 


840 


acctgatttt 


ggaggcattg 


tatattttta 


aatacatcca gatgttgttt 


cagttgcttc 


900 


ctctttggtt 


ctttttgctt 


ttcgttgttg 


agcgtcactt aaattcgtgt 


catttcatgt 


960 


tggtacaggt 


actccacttc 


aaatttccca 


agaaattcag aagaattgtg aacaagttgc 


1020 


tggtttcaca 


atactgcaag 


acactgcaag 


ttattccaag ttcctcagca agtgtttaca 


1080 


cattgggcca 


aggacagatt 


tttcctggag 


aaggatttta ccactgccac 


catcttgaaa 


1140 


ttcttcatcg 


ttttggaaca 


cagagccata 


gattttcatt tctgcactca gctctgttct 


1200 


gagaccgggg 


ccataggggt 


tcttggagag 


acagggcaga tggaagaagt ggaaggcatc 


1260 


tgccacactg 


tagagtgcct 


ttaagccacc 


cccattgcct gagttgtttt 


ccttttttac 


1320 


aaatg 










1325 


<210> 86 

<211> 744 

<212> DNA 

<213> Homo sapien 










<400> 86 

gcgtggtcgc ggcgaggtac 


ttttaactta 


aaaaaatgaa catctttgta gagaattttc 


60 


tggggaacat 


ggtgttcaat 


gaacaagcac 


aagcattgga aatgctaaaa 


ttcagttttg 


120 
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cctcaagatt 


ggaagtttat 


tttctgactc 


attcatgaaa 


gtcattcctt 


aattggaggc 


180 


ccaccattca 


attattcatc 


tattaattcc 


ttgatccttc 


atttatccat 


tctgcaaact 


240 


tttcttgagc 


accagcacgg 


gtggccattt 


gtggacttct 


cttcattcct 


atgtgttttc 


300 


ttatcaaagt 


gatccactct 


cgaaaggctc 


ctttccagtc 


tgtggttggg 


ttcaagtcat 


360 


gccagggcca 


gggggcccat 


ctcctcgttt 


agctctaggc 


aaaatccagg 


ggatctgcag 


420 


tggggagcgg 


gggcaggaag 


ctggagggaa 


ggcctgtgaa 


gggtagggat 


gtggaaagac 


480 


aaggtgacag 


aaggacccaa 


taggaccttt 


ctatatctct 


ggcttagcat 


tttctacatc 


540 


atattgtaat 


cgtcttattt 


gctagttttc 


ttccttactg 


tgagtgacta 


acagtcatct 


600 


ttatcccaag 


tgcctggtac 


ataataagtg 


atcaataaat 


gttgattgac 


taaatgtaaa 


660 


aaaaaaaaaa 


aaaaaagggc 


tgggggaatc 


agggccaagg 


cttgtccggg 


gtgaaattgt 


720 


ttccccccac 


aaaaaaaaaa 


aaac 








744 


<210> 87 
<211> 1833 
<212> DNA 
<213> Homo sapien 












<400> 87 
tgctccagga 


aagttctgtt 


actccaggga 


ctctctcttt 


tcctgataac 


atggccagca 


60 


agaaagtaat 


tacagtgttt 


ggagcaacag 


gggaagctgg 


tggcagactc 


cgccaagcac 


120 


ctgggtctga 


agcacgtggt 


gtacagcggc 


ctggagaacg 


tcaagcgact 


gacggatggc 


180 


aagctggagg 


tgccgcactt 


tgacagcaag 


ggcgaggtgg 


aggagtactt 


ctggtccatt 


240 


ggcatcccca 


tgaccagtgt 


ccgcgtggcg 


gcctactttg 


aaaactttct 


cgcggcgtgg 


300 


cggcccgtga 


aagcctctga 


tggagattac 


tacaccttgg 


ctgtaccgat 


gggagatgta 


360 


ccaatggatg 


gtatctctgt 


tgctgatatt 


ggagcagccg 


tctctagcat 


ttttaattct 


420 


ccagaggaat 


ttttaggcaa 


ggccgtgggg 


ctcagtgcag 


aagcactaac 


aatacagcaa 


480 


tatgctgatg 


ttttgtccaa 


ggctttgggg 


aaagaagtcc 


gagatgcaaa 


gattaccccg 


540 


gaagctttcg 


agaagctggg 


attccctgca 


gcaaaggaaa 


tagccaatat 


gtgtcgtttc 


600 


tatgaaatga 


agccagaccg 


agatgtcaat 


ctcacccacc 


aactaaatcc 


caaagtcaaa 


660 


agcttcagcc 


agtttatctc 


agagaaccag 


ggagccttca 


agggcatgta 


gaaaatcagc 


720 


tgttcagata 


ggcctctgca 


ccacacagcc 


tctttcctct 


ctgatccttt 


tcctctttac 


780 


ggcacaacat 


tcatgttgac 


agaacatgct 


ggaatgcaat 


tgtttgcaac 


accgaaggat 


840 


ttcctgcggt 


cgcctcttca 


gtaggaagca 


ctgcattggt 


gataggacac 


ggtaatttga 


900 
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ttcacattta 


acttgctagt 


tagtgataag 


ggtggtacaa ctgtttggta aaatgagaag 


960 


cctcggaact 


tggagcttct 


ctcctaccac 


taatgggagg gcagattata ctgggatttc 


1020 


tcctgggtga 


gtaatttcaa 


gccctaatgc 


tgaaattccc ctaggcagct ccagttttct 


1080 


caactgcatt 


gcaaaattcc 


cagtgaactt 


ttaagtactt ttaacttaaa aaaatgaaca 


1140 


tctttgtaga 


gaattttctg 


gggaacatgg 


tgttcaatga acaagcacaa gcattggaaa 


1200 


tgctaaaatt 


cagttttgcc 


tcaagattgg 


aagtttattt tctgactcat tcatgaagtc 


1260 


atctattgag 


ccaccattca 


attattcatc 


tattaattcc ttgatccttc atttatccat 


1320 


tctgcaaact 


tttcttgagc 


accagcacgg 


gtggccattt gtggacttct cttcattcct 


1380 


atgtgttttc 


ttatcaaagt 


gatccactct 


cgaaaggctc ctttccagtc tgtggttggg 


1440 


ttcaagtcat 


gccagggcca 


gggggcccat 


ctcctcgttt agctctaggc aaaatccagg 


1500 


ggatctgcag 


tggggagcgg 


gggcaggaag 


ctggagggaa ggcctgtgaa gggtagggat 


1560 


gtggaaagac 


aaggtgacag 


aaggacccaa 


taggaccttt ctatatctct ggcttagcat 


1620 


tttctacatc 


atattgtaat 


cgtcttattt 


gctagttttc ttccttactg tgagtgacta 


1680 


acagtcatct 


ttatcccagt 


gcctggtaca 


taataagtga tcaataaatg ttgattgact 


1740 


aaatgtaaaa 


aaaaaaaaaa 


aaaaagggct 


gggggaatca gggccaaggc ttgtccgggg 


1800 


tgaaattgtt 


tccccccaca 


aaaaaaaaaa 


aac 


1833 



<210> 88 

<211> 251 

<212> DNA 

<213> Homo sapien 

<400> 88 



ttaaaaatgt 


aaaaaaatga 


aaaaaaaagt 


tttgagcatt 


atttgcatca ttgggataca 


60 


tatgtcactt 


cacaagatgt 


tcaatttgaa 


ggaaatacca ctcattctct atgtcctgtt 


120 


gtctgtagtg 


tgcttcagtt 


ttctcatatt 


gagttgacct 


aaatcctgga ttcatgacaa 


180 


gaaaggagta 


agtactacta 


ttcattgttc 


tatttgttta 


taatctgtat tataaaattg 


240 


cacataatta 


a 








251 



<210> 89 

<211> 458 

<212> DNA 

<213> Homo sapien 



<220> 

< 2 2 1 > mi s c_f e a t ur e 
<222> (327) . . (327) 



WO 02/068633 
<223> a, c, g or t 



80 
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<220> 

<221> misc_f eature 

<222> (435) . . (457) 

<223> a, c, g or t 



<400> 89 
ttgatattta 


tttagaaaac aagggaaagc ttttaattgt gtgcaatttt 


gtaagacaga 


60 


ttataaacaa 


atagaacaat gaatagtggg acttactcct ttcttgtcat 


gaatccagga 


120 


tttaggtcaa 


ctcaatatga aaaactgaag cacactacag acaacaggac 


atagagaatg 


180 


agtggtattt 


ccttcaaatt gaacatcttg tgaagtgaca tatgtatccc 


aatgatgcaa 


240 


ataatgctcc 


actctgtcgc tcaggctgga gtgcagtggc gcaattagaa 


ctcactgcag 


300 


ccctaaccct 


cttggcttaa acaatcnctc caccttagcc ctctgagtag 


ctagcactac 


360 


aggcacacgc 


caccacaccc agctaacttt tttgaatttt tggagacagg gttcacctta 


420 


ttttttctcg 


tgccnnnnnn nnimnnnnnn nnnnnnnc 




458 



<210> 90 

<211> 251 

<212> DNA 

<213> Homo sapien 

<400> 90 

tggtcgcggc gaggtacaaa aacatggcag taggacaact ttgagctcaa cgcgcgcacc 60 

tccccaggac ctctcacacc cacctgaccc atccaagggc cacaccaccc cgacagatac 120 

tcccaccacc tttagaaaag agtcacccaa tctggagaaa ggtgtggagg ttacatcttt 180 

agaaagaaat cattttaaat acatgaacat tagagaacac agtaaccgtg cttccaccca 24 0 

gcacggggag g 251 



<210> 91 

<211> 2399 

<212> DNA 

<213> Homo sapien 

<400> 91 

ggccgcatct ttttattttt tttttttttc ttacaaatgt gagttttatt tgcattctaa 60 

cccaagcaca gatcccacat acagacttct cgtccgcact atggtacagt aacaagtgca 120 

aaatatgctt tatttcaggt acaaaaacat ggcagtagga caactttgag ctcaacgccc 180 

acctccccag gacctctcac acccacctga cccatccaag ggccacacca ccccgacaga 240 



tactcccacc acctttagaa aagagtcacc caatctggag aaaggtgtgg aggttacatc 



300 
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tttagaaaga 


aatcatttta 


aatacatgaa 


cattagagaa 


cacagtaacc 


gtgcttccac 


360 


ccagcacggg 


gaggctgcag 


agaggcaccc 


caaacagtcc 


ccttctcttc 


tgtggttaac 


420 


caagcaagcc 


ccgcaaagct 


tttcccagca 


gagcacaccc 


agccagcaat 


gaggcctcag 


480 


gacagagcca 


gcactgtgac 


aaaagaactt 


gtcccaccta 


tctgagagca 


tgagatctgc 


540 


aagtgggatc 


cggactggac 


accaggaagc 


tcaacctggc 


tcgggccatc 


agtagctgtg 


600 


ggaccctggc 


ccccatctaa 


cctcccagtt 


gacttcctga 


aaactgggct 


agtgggggcc 


660 


tccccattgc 


tttagggagg 


atcgagtgag 


aagctgacca 


aatgcgcaga 


acacaaagcc 


720 


tagcctgtta 


gccccagctg 


ctatcactac 


tgttactcac 


gaagaaagga 


actgcagtgg 


780 


ctgctgtctg 


cccctggagt 


gaagacacca 


ggggtgcaca 


gaggctggac 


aagccacagg 


840 


gggctggatg 


gccacagccg 


gacctcggac 


ccatctcccg 


tgggactcct 


tccatttgag 


900 


gctgtccatc 


ctccaatggc 


agatgcagaa 


ttctcaggcc 


caggataccc 


ttatggaggg 


960 


ccggggggcg 


gcccctcaaa 


accaccaagc 


tgaggccact 


gtgagcagtc 


ccctttgtag 


1020 


caggaagagc 


agctggacac 


atgggaagag 


gcttgcaggc 


tccagagaag 


ccaaaaggtc 


1080 


ctcagtcccc 


aaggtactag 


gtggaggtga 


gatcaggctg 


cccctcccag 


gacaagagtg 


1140 


tgagctggcc 


aggaggccac 


tgctgcccac 


cactgctcaa 


agtccttctc 


cacgcatcaa 


1200 


ccctgaggga 


cctggccagg 


ggatgcagac 


caaaggccca 


gcgggtcccc 


aggaacagga 


1260 


agcaggcggc 


aagaaggaaa 


gcaaagggct 


ctgcattccc 


accacgaggc 


ggggtggact 


1320 


ctgcaggagc 


cagctgaagc 


accagaacct 


tccaaggggg 


gctcgccagc 


cacacaggac 


1380 


acgagatgcg 


tgataggcag 


ggttaatgca 


gaaaccgctc 


atctgaatgc 


ttcccctgct 


1440 


tccaagcgca 


aaccaagtca 


ttttattctt 


ttaaaaaagc 


ccttactgtc 


tgaggctttc 


1500 


tttaaataat 


ccaacaggga 


tggaggtaat 


ggaggtctgg 


ctcagtagtc 


agcagacttt 


1560 


ttctgtaacg 


ggctagacgg 


taaatatttt 


gggctttcag 


agccaagagg 


caaaatcaat 


1620 


attatgttgg 


cgtaagagca 


aacaaatttc 


cacaattttt 


aaattgataa 


aatccaaaat 


1680 


acaataattg 


agtacatatt 


tttggtaaat 


acatgtctac 


taaaaataag 


aatttttttt 


1740 


tttgtgatgg 


gggagaaaac 


acttcattgg 


gttcaaagat 


aatattccct 


atcataaatc 


1800 


agtggcataa 


acagccagtg 


ggccaggtgg 


gcccaccgta 


tgctagctga 


caagggcagt 


1860 


cacacaccgg 


taagtgctga 


tacaggatga 


ggggtgcgga 


gctgaggatg 


ttgttctctt 


1920 


tgatgaaggc 


ctgaaaactt 


cccctggccc 


ctgctgccca 


cctcagggct 


cctcaggacg 


1980 


aagggcatga 


ggacaggttc 


ccactctggc 


ccttcgctgg 


gggacatggt 


ttggtttccc 


2040 
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catcgccagg ctggccagct 


ggggcccaag 


82 

cactgaccgt 


ccttccccac 


cttatcccat 


2100 


cagctgccct gcacaggagc 


agaaaaccca 


tgcctcagag 


ctacttccta 


aggacactgc 


2160 


r , t~r*t - aciaacref cc ctaaacrfcci 


acaaaaccaa 


acjatciQcacrcJ 


tgcagccctg 


gccactagca 


2220 


cagcagttgc cctcagatct 


gttagctgtg 


acccatagat 


tgggggaggg 


aggaaagcca 


2280 


ggaggagctt ttgcagaaag 


gtctggaagc 


taagtggggg 


tttcaggagc 


tctcagggtg 


2340 


cccctgggca ggtcccaagg 


aggcctgggg 


tgcctgacca 


gcacctcact 


ggccctcaa 


2399 


<210> 92 

<211> 595 

<212> DNA 

<213> Homo sapien 












<400> 92 

gtgatcgact cactatagca 


tgttatctag 


atgcatgctt 


cgagcggcgc 


ctagtgatgg 


60 


ataggtcgcg gcgaggtaca 


aatcattaag 


gcaaatctac 


aaaggaccag 


taaggcactg 


120 


gggcacagca gctgactgca 


ggaagtctat 


gtaacttccg 


gtagtaaaac 


agcctttaat 


180 


gcagacatag gtgtgaaaaa 


ttcagtgtta 


tgtttttttt 


tcagtgtcaa 


cctgcaccta 


240 


aagcacaaaa ggtcacttca 


agtcacttca 


atgtaataaa 


caattcccat 


tcatcacatt 


300 


accagtgaaa agagaaataa 


ttcactgggc 


tagaaggaat 


tagaagcagg 


aataaagaaa 


360 






atgtctttaa 


ctctccaagg 


aataccattc 


420 


cgctttgtga aattatacca 


attaattagg 


actcctctat 


attcccatta 


atgtcggatc 


480 


attacataac ttcaatagaa 


taaacacttt 


tttcccactt 


tgctataata 


atagctaggg 


540 


atacctcaac aatataataa 


tggtttaaat 


tatgaccatt 


tctctttggc 


cttgt 


595 


<210> 93 

<211> 1457 

<212> DNA 

<213> Homo sapien 












<400> 93 

cttatcctag agaataactc 


tgtatgaata 


aaattgctta 


attgagtctc 


ttactaaata 


60 


agtaactagt gccatgcttt 


tgtgagctct 


tggtatggcc 


catattactt 


tgtttttttt 


120 


ttttttattg ttgttttgtg 


atagtcttgc 


tctgtcgccc 


aggctgcagt 


gcagtggcac 


180 


aatctcagct cactgcaacc 


tctgcctcct 


gggttcaagc 


aattctcctg 


tctcagcctc 


240 


ctgggtagct gggactacag 


gtgcatgcca 


ccatgcctgg 


ctaacttttg 


tatttttagt 


300 


agagacaggg tttcaccacg 


ttggtcaggc 


tggtctcgaa 


ttcctaacct 


caggtgatcc 


360 


acctgccttg gcctcccaaa 


gtgctgagat 


tacaggcgtg 


agccaccgcg 


cctggcctgt 


420 
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ttgttttttt 


aacatgattt 


ttctctaagc 


ttaaatacca 


caaggccaaa 


gagaaatggt 


480 


cataatttaa 


accattatta 


tattgttgag 


gtatccctag 


ctattattat 


agcaaagtgg 


540 


gaaaaaagtg 


tttattctat 


tgaagttatg 


taatgatccg 


acattaatgg 


gaatatagag 


600 


gagtcctaat 


taattggtat 


aatttcacaa 


agcggaatgg 


tattccttgg 


agagttaaag 


660 


acattctctt 


tagtaagtgt 


aaaccaaagg 


gcattttctt 


tattcctgct 


tctaattcct 


720 


tctagcccag 


tgaattattt 


ctcttttcac 


tggtaatgtg 


atgaatggga 


attgtttatt 


780 


acattgaagt 


gacttgaagt 


gaccttttgt 


gctttaggtg 


caggttgaca 


ctgaaaaaaa 


840 


aacaaaacac 


tgaatttttc 


acacctatgt 


ctgcattaaa 


ggctgtttta 


ctaccggaag 


900 


ttacatagac 


ttcctgcagt 


cagctgctgt 


gccccagtgc 


cttactggtc 


ctttgtagat 


960 


ttgccttaat 


gatttgtaca 


aatgactggg 


aggcggggat 


gctgcctgtg 


tcctggtgaa 


1020 


ccttaatgaa 


ggggccgtct 


taggcacagt 


gcaaaacaag 


catttgtcct 


gtactgttag 


1080 


agccaaaatt 


gtgatgagca 


atactgataa 


ttgtccagtt 


tatgtcatct 


ttcccagatt 


1140 


ttaaaatctg 


ttctagatat 


tcttagcttg 


aaccactttt 


gattgtgaaa 


tgtattaggt 


1200 


gttgtcccat 


tattactgta 


aaatgaagtt 


ttgaatcttc 


ttgttaataa 


actgtggatt 


1260 


tcccctctca 


atttcttaaa 


caacaacaaa 


aaaatgcttg 


aagattgtct 


ttgagtgtaa 


1320 


gatctgcctt 


ttcagaaagg 


gagtgttagt 


ttgtaatgtt 


aaaaaataaa 


gacctcattc 


1380 


aataaaagtt 


gaagtcatct 


tttaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1440 


aaaaaaaaaa 


actcggc 










1457 


<210> 94 

<211> 936 

<212> DNA 

<213> Homo sapien 












<400> 94 

tatcactata gcgatgtgct 


ctagatcatg 


ctcgagcggc 


gcagtgtgat 


ggattgctgg 


60 


ctgccggcga ggtacacaac 


gccttctaat 


ttgatcttca 


tgactgtaga 


cgggtgcccc 


120 


attccccctg 


aaacaacggt 


tgcgaggttt 


gcgcattact 


gcttcgcctg 


tgcatgacta 


180 


tcttgctgta 


actactagcg 


atgatcttgt 


atcgaattgt 


atctcagcca 


cttatcctct 


240 


cagtatagta 


caattaccat 


tgcatacttg 


tatacctcat 


ggagaaggga 


gttctctcca 


300 


gtcccaaccc 


atgaatgtgt 


tatcgattcg 


cgcattcgtt 


cgctcgcaac 


ct'cagataaa 


360 


cactcttcat 


actttatcac 


taccttccat 


cttagtcaaa 


gctcaagaca 


aaaattaaac 


420 


aaaatgacac 


ctagccattc 


acctattaca 


accatcgacg 


tataagacac 


accattcgac 


480 



WO 02/068633 PCT/USO 1/4361 2 

84 



gtgaggaaat 


gacacacgta 


tttccatcaa 


cgagagtgct 


ggatgggagg 


gcaggacaca 


540 


gactggaagt 


cagcttcagc 


gtatacccac 


ttactcacaa 


gggctgacca 


tgatggccct 


600 


gctcttgcat 


gaggactcac 


atggacagcc 


agtggcactg 


atgcctgcag 


gactgcatgt 


660 


gccgacaaga 


tgatgcacat 


tttaatattc 


aaaatgtaaa 


caaagtatcc 


ac aaa t ag t c 


/ A\J 


caaaaaagcc 


aaagaatggc 


tttacttaac 


aaatttttaa 


attgccctta 


aaagtgactt 


780 


tttggcataa 


tgcctttctt 


aaaaaaagaa 


ctctcacaaa 


aaaccatttt 


ttttaaccca 


840 


aaaacaatgc 


ctagatcact 


tcagaactga 


aatcctcaga 


tgaaacaggt 


gaacacttct 


900 


gtgactgcta 


cctctcagaa 


ggcttgggcg 


tatcat 






936 


<210> 95 

<211> 480 

<212> DNA 

<213> Homo sapien 












<400> 95 

tctcgagccg cgccagagtg 


atggatgcgg 


ccgccgggca 


ggtacttatt 


aagcagagca 


60 


ctttgtaaga 


ttcagaactg 


actcctgatt 


atcttaagcc 


attaaaagac 


tcaagttttg 


120 


catggacttt 


ttgtcctctt 


tggcccctga 


gtgtgcccca 


tctctgccca 


gcactaataa 


180 


cacgttggaa 


gagcaaagga 


tttcccacca 


gcgtgccaga 


tgatttatat 


aggcagggac 


240 


cttcattgaa 


aagaagggaa 


gggacactgt 


cttaaatgca 


catttaaggt 


tctttctgat 


300 


tattggcaat 


ctctatgaac 


caacacttaa 


tccatgggct 


aacagagaga 


tttttttttt 


360 


tttaatgtga 


agaggattaa 


agaataaaga 


aaaaacaaaa 


aagtctttat 


tctaaaataa 


420 


gaaatcagcc 


ccatctgtgg 


cacagttctc 


atgcagaata 


ttgcacccag 


tgtgaactag 


480 


<210> 96 

<211> 1111 

<212> DNA 

<213> Homo sapien 












<400> 96 

cgtccaatca gaggctctgt 


atttataaat 


aacccgtgtg 


ggattatgct 


ctccagtgaa 


60 


tataactgga 


gcctaaattc 


acaaccttaa 


gatctgacag 


ccagatgtgg 


aaggtctaca 


120 


aagagctctg ctaataagta 


atatgtttgc 


aaagtgctct 


gctaagtaag 


gtacttatta 


180 


agcagagcac 


tttgtaagat 


tcagaactga 


ctcctgatta 


tcttaagcca 


ttaaaagact 


240 


caagttttgc 


atggactttt 


tgtcctcttt 


ggcccctgag 


tgtgccccat 


ctctgcccag 


300 


cactaataac 


acgttggaag 


agcaaaggat 


ttcccaccag 


cgtgccagat 


gatttatata 


360 
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ggcagggacc 


ttcattgaaa 


agaagggaag 


85 

ggacactgtc ttaaatgcac atttaaggtt 


420 


ctttctgatt 


attggcaatc 


tctatgaacc 


aacacttaat ccatgggcta acagagagat 


460 


tttttttttt 


taatgtgaag 


aggattaaag 


aataaagaaa aaacaaaaaa gtcttatact 


540 


aaaataagaa 


atcagcccca 


tcttggcaca 


gttctcatgc agaatattgc acccagtgtg 


600 


aactaacgct 


agaagcttca 


aactgtataa 


atttaaatgt atttgcatat tataaaaata 


660 


aagataaaca 


tatacatatt 


ttacactagt 


tatggaacag caatgaacgt cagtcgatcc 


720 


ctctttcaca 


tttaacagaa 


ctgaaatctg 


agtgctctaa atactgccac ctgtactgta 


780 


actatggctt 


atatgtgcac 


ggaaaacaaa 


atccctgaga agccattcga cttttttttt 


840 


ttttttttct 


tttcttcaag 


tagcgcgctc 


cttggaggat cacagttctg aggttcaggt 


900 


tgtaaaacat 


ttgctccatg 


ttctcgtcca 


tgcttccccc caccaccccc tccccacctc 


960 


ttccccagtc 


atccaaaaag 


caccctgcaa 


gcacgcgttg tcactcaagt tcacagaaca 


1020 


cgctggggtg 


agtgcagagg 


gtctgccagg 


tgcaaaagat ggtccaggtg ttcagatgct 


1080 


ctcttttctc 


catggaaatt 


ccacagccac 


a 


1111 


<210> 97 

<211> 395 

<212> DNA 

<213> Homo sapien 








<400> 97 
gcgtggtcgc 


ggcgaggtac 


caccagctac 


ttcagagtga agaaagaaag tgtctttcgc 


60 


cactggttct 


taaggagtgg 


atctccggtc 


cacatctccc aatgctgctt agcccaggaa 


120 


atgaagacag 


ctgacaacac 


agaggaacat 


taacgaaaag aacaaacccc aaacttgaaa 


180 


tgcataactg ggactgctgg 


aatgggccaa 


ggcataccac ggcgggtcac tgtcaccagg 


240 


agggcgcctg 


tgttttagaa 


gggagtggcc 


aacaccgtct ggccaacctg gaagggagcc 


300 


aacgtgacag 


ctaagaacac 


tagacaagca 


tcctagcctt ctcacccatc gatctcagtg 


360 


tgatcttgcc 


acagagccac 


tggcaagcca 


ttcgg 


395 


<210> 98 
<211> 3813 
<212> DNA 
<213> Homo sapien 








<400> 98 
tttttttttt 


gcatacagaa 


ttattttatt 


taacttaaac catgtagtac tttactagaa 


60 


aaaagcagag 


taagagaaac 


taacgttgcc 


ttagcttcag ccattcaaaa tagacagttt 


120 


cttttttcca 


ttatgtaaag 


aatccagagt 


atatcgcaat aacaggaata aattcttaca 


180 
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acagaatata 


caaaaacatt 


ttgaaatttt 


tttcatctac 


tgatttttta 


tataaacagg 


240 


attttttagg 


aataatttat 


acacagaaag 


tcattttatg 


taacaaattg 


gccatgttat 


300 


tacctttttt 


tttcttactt 


aaaaaaaatt 


tttttttaac 


aagaaaactc 


agaaaatgca 


360 


ttatttgcgg 


tgcatccatt 


ccatcccgcc 


ttctggtttg 


atttttttta 


tcccagacaa 


420 


agggataccc 


agaggtagac 


aaactctggc 


aaacctctca 


cctcaacctc 


actggcttag 


480 


aaagcagaca 


ggtgttttca 


ccgggcgtct 


cttccaccgg 


tggatgtgtg 


tgtgcaacgc 


540 


caaacatcca 


aatgaaaagt 


tgaaaacaaa 


acccaaatag 


tttccagatc 


tttctcctat 


600 


gtagcgcaaa 


aaccccaggt 


gtttcccatg 


atagagatat 


tgtggagttg 


gggagacatg 


660 


gccaacaatc 


cactggagtt 


agaggaagtg 


gagtcaactt 


taaaaccaat 


ttttgtgtat 


720 


gttatagttt 


ttattaaaaa 


ttcttttctg 


tattggaaat 


acgtatcttc 


ggggttctgc 


780 


cttaaataaa 


atagggtctc 


ttctgtgttc 


taacaaagca 


gtaatcaatc 


atgatacgag 


840 


gcatgtaagc 


aaatggcctg 


tgtgccccat 


agctgtcatc 


gtttatgtga 


tcactgtgta 


900 


tgctgtaatc 


atcatgaaaa 


ccattggctg 


tgcagtcaag 


tcttcccatg 


tccatggcac 


960 


tgtgctgaca 


atgacggcag 


ccagaccaat 


caagccagat 


gccgaacggt 


gccatgcact 


1020 


gctgggctcc 


aacacctcgg 


ctgtttccat 


ggccaatgag 


gttctttcca 


tctgtaaagt 


1080 


agctacactc 


caaagggaaa 


tgaccagtga 


aaaatgatcc 


ccgctcccaa 


ccagagggtt 


1140 


ctgggaagac 


ccagatactg 


aagatacaga 


actacttgtt 


aaatcatcat 


cattcctttg 


1200 


ttataaacta 


gggccaaaaa 


gcatgggggg 


aaaagagggt 


gaaccacgga 


tggggtcagg 


1260 


gagaaatcct 


tttttggcat 


attgtaaagc 


tagctagtaa 


acaggtgtcc 


taaaaaaaaa 


1320 


aaaaaaatgg 


atcacaaaca 


cacatttgca 


cacacagaga 


aaaaaaaagt 


gtctttaaat 


1380 


tattttcggc 


aattataaac 


tacaaacatt 


ttataaaatt 


attctgttct 


tacaaaaatg 


1440 


caatgaaaca 


tttaattagt 


tcttgtaaac 


cagatcttcg 


tcaactgact 


acccgtaatc 


1500 


tactggactt 


aagagcccta 


ttgaaaacgc 


taatgagtga 


ctagttaaaa 


caatgttgac 


1560 


cagaaagatg 


cggacacaca 


cacgtgggac 


ataaaaatga 


gtgatcgtta 


tgttctcgca 


1620 


gaaacaaaac 


gctaaagtca 


tgccccagaa 


ctgacaactg 


acgagtgcga 


tgtagtttgt 


1680 


gttcttgaac 


aattcgcctt 


aactccgaga 


tcagtttcca 


gagtaggatg 


ttaggtttct 


1740 


ttgttgaaac 


caaatgccca 


cggatgactg 


ctgagcctcg 


atgtggtctg 


atagcttcaa 


1800 


gaacaatcaa 


gttcaaaacc 


ttgcccacat 


tccttccaag 


ccctgcaccc 


tgctgctcgt 


1860 


aactcttctc 


tgtgccagga 


tccctgtgac 


aatggttctg 


tacccaccac 


ccggtcattc 


1920 
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caccatcaag 


gagcactcac 


ttctccaaat 


gacgcaacca 


gtcatctaag 


atgaaggcca 


1980 


aaaatgaaac 


cctctgacct 


gtccaaggtc 


tgtatcttca 


gcatctctgg 


caacagaggt 


2040 


tcagtctaaa 


taacatcctc 


tcttgaagtt 


ctcatttgta 


ccaccagcta 


cttcagagtg 


2100 


aagaaagaaa 


gtgtctttcg 


ccactggttc 


ttaaggagtg 


gatctctggt 


ccacatctcc 


2160 


caatgctgct 


tagcccagga 


aatgaagaca 


gctgacaaca 


cagaggaaca 


ttaacgaaaa 


2220 


gaacaaaccc 


caaacttgaa 


atgcataact 


gggactgctg 


gaatgggcca 


aggcatacca 


2280 


cggcaggtca 


ctgtcaccag 


gagggcgcct 


gtgttttaga 


agggagtggc 


caacaccgtc 


2340 


tggccaccct 


ggaaggagcc 


aacgtgacag 


ctaagaacac 


tagacaagca 


tcctagcctt 


2400 


ctcacccatc 


gatctcagtg 


tgatcttgcc 


acagagccac 


tggcaagcca 


ttcggctgcc 


2460 


agcctgcaga 


ggtggactgc 


tgtgggaaaa 


gcttcagtgg 


aacaaactgg 


cctagaatct 


2520 


ggtgccagag 


aatagttggc 


tcataaatct 


tttatcccaa 


agcaccccaa 


tgtgcaccga 


2580 


gcatctcgcc 


atttggtagt 


tccacagtga 


ctgctcttct 


attttacgaa 


gccacttttc 


2640 


gtaccattga 


aatgtgactc 


cgcatcttaa 


gaaacaggcc 


aggagaggcc 


aggtcggcca 


2700 


cgaacacggt 


cttccatctt 


ccaaggagga 


ggctcgttcc 


aaaggtatgt 


caaacgtcat 


2760 


gacatcccac 


gaaggagacc 


tggggtggtc 


tgggtcttga 


tcaagttcaa 


cagtgtctcc 


2820 


agtacacacc 


agctcctgct 


gcctcaggga 


agggctgtgt 


gggagaacgt 


cttaatgctc 


2880 


acatcaattc 


catgatgtgc 


ctcagtggtt 


catatgactt 


caagcagtgt 


gagctgctgg 


2940 


cagaaggagg 


ttgcatggaa 


aatgccacaa 


ctcagcttaa 


gattatttgg 


aacacacaga 


3000 


gaaaaatcaa 


gaggcagaga 


aatcagcatg 


aagttaggct 


agaactgtgg 


atagatgatg 


3060 


ccttggtata 


gtcaaggtgg 


gtgggttggt 


ccagtaaaat 


gactgcacca 


tcacacaagc 


3120 


caagttttct 


cttttattga 


ggtgtttctg 


aagcatttgc 


tatgatgggc 


actacactgt 


3180 


tagtgttttc 


tacttaaaag 


ctacctacat 


aaaacaaaaa 


caaaatgaca 


tcagtaactc 


3240 


ggacagttta 


aacaaaaccc 


ttataaccag 


ctgaatgatc 


tgtagaaatc 


cagttgtgga 


3300 


aaccggtagg 


tatgtgcacg 


agaggagagg 


ccattattcc 


aaggggtcac 


ctacagtgtg 


3360 


gaagaacaaa 


accctgatcc 


aacagtattt 


ctcaatgacc 


ttaagatccc 


cggggtcctg 


3420 


ggagagatgg 


gtcgcgttcc 


agcaggaggc 


tcctgagcgc 


i~t ^ 4— ^3 /*! 4~ ^ 

CttCaCLLdy 


gctggaattg 


jrk OU 


ccgaggagcc 


atccacgctt 


aggcaaagct 


gcccgccaca 


cttcccagcg 


aaatcatcag 


3540 


tgcctgtaag 


agcagctgct 


ggaagagggg 


agcgagccgt 


ctgtcctgtg 


agcagcagga 


3600 


ccagggagac 


ggaggtacag 


ccctggcggg 


gggggcgtga 


cgccacgagc 


ctcggcgccg 


3660 


gccactgggc 


tttccagccg 


gaaggtttgg 


tacatctctg 


tctaccccta 


gggaaggagg 


3720 
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tgagggagaa gttgagaggg 


cagcctggag 


ggggagggga ctctgctcct gcgtgtgttc 


3780 


tatcagggac 


tggctgtgag 


gtccggagct 


ctg 


3813 


<210> 99 

<211> 960 

<212> DNA 

<213> Homo sapien 








<220> 

<221> misc_f eature 
<222> (716) . . (716) 
<223> a, c, g or t 








<400> 99 
atcrctcaaac 


cggcgcatat 


gtgatggatc 


gcccgggcag gtaccttggg gactgaggac 


60 


uiui uy yv-ui. 


rtctaaaacc 

v>> t» w L» y y y 


tgcaagcctc 


ttcccatgtg tccagctgct cttcctgcta 


120 


^Actciy yy y ciw 


tachcacaat 


ggcctcagct 


tggtggtttt gaggggccgc cccccggccc 


180 


tccataaggg 


tatcctgggc 


ctgagaattc 


tgcatctgcc attggaggat ggacagcctc 


240 


aaatggaagg 


agtcccacgg 


gagatgggtc 


cgaggtccgg ctgtggccat ccagccccct 


300 


gtggcttgtc 


cagcctctgt 


gcacccctgg 


tgtcttcact ccaggggcag acagcagcca 


360 


ctgcagttcc 


tttcttcgtg 


agtaacagta 


gtgatagcag ctggggctaa caggctaggc 


420 


tttgtgttct 


gcgcatttgg 


tcagcttctc 


actcgatcct ccctaaagca atggggaggc 


480 


ccccactagc 


ccagttttca 


ggaagtcaac 


tgggaggtta gatgggggcc agggtcccac 


540 


atgctgctga 


tggcccgagc 


caggttgagc 


ttcctggtgt ccagtccgga tccccacttg 


600 


cagatctcat 


gctctcagat 


aggtgggaca 


agttcttttg tcacagtgct ggcctctgtc 


660 


ctgaggcctc 


attgctggct 


gggtgtgctc 


atgcgtggga aaaagcttgg gccgtnaact 


720 


atggccatac 


ctgttccctg 


tgtggacatt 


gttctcccgc tcccatcccc atcccgccgc 


780 


acccaaccca 


ccccaacacc 


cacaccccac 


gcccccaaac gcccacaccc acaacaccac 


840 


acccccccac 


ccccaccccc 


ccacacccac 


cccaacaccc acaccctcgc ccccccccca 


900 


cctccaccca 


cccaccaacc 


cacaacacac 


cacacaccac ccaccaccag caccaccacc 


960 


<210> 100 
<211> 2754 
<212> DNA 
<213> Homo sapien 








<400> 100 

gccagaagca gcctcagctt 


ggcaaggtgt 


ggagatgact gctgttccct tcgcatttgg 


60 
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QcraaaacaQcr 


ctccctcggt 


agctcgatga 


tcctcttttg 


atcttgtgtg 


acctcctgga 


120 


qaqtqqatqa 

3 3 33 3 


cgctggtggc 


cttagctttt 


ctagacagtg 


taaattgcac 


tgggcgatgt 


180 


ccccagagca 


gggcaaggtc 


tctagagcgg 


gtctcccaca 


tgactggctt 


cacacaggca 


240 


cttccgctcg 


qqttqcatqc 


tctgtgtcat 


cttaccggtc 


cagggttgca 


ggtaggaaat 


300 


gtttgtaccc 


tcttctgatt 


gccacctcct 


tcccatcgcc 


ccttagggac 


agggcttgag 


360 


qqccaqtqaq 

_3 J .3 3 3 


qcqctqqtca 


ggcaccccag 


gcctccttgg 


gacctgccca 


ggggcaccct 


420 


gagagctcct 


gaaaccccca 


cttagcttcc 


agacctttct 


gcaaaagctc 


ctcctggctt 


480 


tcctccctcc 


cccaatctat 


gggtcacagc 


taacagatct 


gagggcaact 


gctgtgctag 


540 


tqgccaqggc 


tgcacctgcc 


atccccggct 


ctgccacttt 


agggccttct 


agaggcagtg 


600 


tccttaggaa 


gtagctctga 


ggcatgggtt 


ttctgctcct 


gtgcagggca 


gctgatggga 


660 


taaqqtqqgq 


aaggacggtc 


agtgcttggg 


ccccagctgg 


ccagcctggc 


gatggggaaa 


720 


ccaaaccatg 


tcccccagcg 


aagggccaga 


g tggga acct 


gtcctcatgc 


ccttcgtcct 


780 


aaqqacjccct 


qaqqfcqqqca 

3 R 33 w 333 vw 


qcaqqqqcca 

3 W 3333 vw 


gggqaagttt 


tcaggccttc 


atcaaagaga 


840 


acaacatcct 


cagctccgca 


cccctcatcc 


tgtatcagca 


cttaccggtg 


tgtgactgcc 


900 


cttgtcagct 


agcatacggt 


gggcccacct 


ggcccactgg 


ctgtttatgc 


cactgattta 


960 


tgatagggaa 


tattatcttt 


gaacccaatg 


aagtgttttc 


tcccccatca 


caaaaaaaaa 


1020 


aattcttatt 


tttagtagac 


atgtatttac 


caaaaatatg 


tactcaatta 


ttgtattttg 


1080 


gattttatca 


atttaaaaat 


tgtggaaatt 


tgtttgctct 


tacgccaaca 


taatattgat 


1140 


tttgcctctt 


qqctctqaaa 


gcccaaaata 


tttaccgtct 


agcccgttac 


agaaaaagtc 


1200 


tgctgactac 


tgagccagac 


ctccattacc 


tccatccctg 


ttggattatt 


taaagaaagc 


1260 


ctcagacagt 


aagggctttt 


ttaaaagaat 


aaaatgactt 


ggtttgcgct 


tggaagcagg 


1320 


ggaagcattc 


agatgagcgg 


tttctgcatt 


aaccctgcct 


atcacgcatc 


tcgtgtcctg 


1380 


tqtqqctqqc 


gagcccccct 


tggaaqqttc 


tggtgcttca 


gctggctcct 


gcagagtcca 


1440 


ccccgcctcg 


tqqtqqqaat 


gcagagccct 


ttgctttcct 


tcttgccgcc 


tgcttcctgt 


1500 


fcccfccrqqcrac 


CcqctqqQCC 


tttggtctgc 


atcccctggc 


caggtccctc 


agggttgatg 


1560 


cgtggagaag 


gactttgagc 


agtggtgggc 


agcagtggcc 


tcctggccag 


ctcacactct 


1620 


tgt cc tggga 


ggggcagcct 


gatctcacct 


ccacctagta 


ccttggggac 


tgaggacctt 


1680 


ttggcttctc 


tggagcctgc 


aagcctcttc 


ccatgtgtcc 


agctgctctt 


cctgctacaa 


1740 


aggggactgc 


tcacagtggc 


ctcagcttgg 


tggttttgag 


gggccgcccc 


ccggccctcc 


1800 


ataagggtat 


cctgggcctg 


agaattctgc 


atctgccatt 


ggaggatgga 


cagcctcaaa 


1860 
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tggaaggagt 


cccacgggag 


atgggtccga 


ggtccggctg 


tggccatcca 


gccccctgtg 


1920 


gcttgtccag 


cctctgtgca 


cccctggtgt 


cttcactcca 


ggggcagaca 


gcagccactg 


1980 


cagttccttt 


cttcgtgagt 


aacagtagtg 


atagcagctg 


gggctaacag 


gctaggcttt 


2040 


gtgttctgcg 


catttggtca 


gcttctcact 


cgatcctccc 


taaagcaatg 


gggaggcccc 


2100 


cactagccca 


gttttcagga 


agtcaactgg 


gaggttagat 


gggggccagg 


gtcccacagc 


2160 


tactgatggc 


ccgagccagg 


ttgagcttcc 


tggtgtccag 


tccggatccc 


acttgcagat 


2220 


ctcatgctct 


cagataggtg 


ggacaagttc 


ttttgtcaca 


gtgctggctc 


tgtcctgagg 


2280 


cctcattgct 


ggctgggtgt 


gctctgctgg 


gaaaagcttt 


gcggggcttg 


cttggttaac 


2340 


cacagaagag 


aaggggactg 


tttggggtgc 


ctctctgcag 


cctccccgtg 


ctgggtggaa 


2400 


gcacggttac 


tgtgttctct 


aatgttcatg 


tatttaaaat 


gatttctttc 


taaagatgta 


2460 


acctccacac 


ctttctccag 


attgggtgac 


tcttttctaa 


aggtggtggg 


agtatctgtc 


2520 


ggggtggtgt 


ggcccttgga 


tgggtcaggt 


gggtgtgaga 


ggtcctgggg 


aggtgggcgt 


2580 


tgagctcaaa 


gttgtcctac 


tgccatgttt 


ttgtacctga 


aataaagcat 


attttgcact 


2640 


tgttactgta 


ccatagtgcg 


gacgagaagt 


ctgtatgtgg 


gatctgtgct 


tgggttagaa 


2700 


tgcaaataaa 


actcacattt 


gtaagaaaaa 


aaaaaaaaat 


aaaaagatgc 


ggcc 


2754 


<210> 101 
<211> 301 
<212> DNA 
<213> Homo sapien 












<400> 101 
cagcggggga 


atgtatagac 


aatatggccc 


atggtgcact 


aatgctgcga 


gcggccgccg 


60 


agatgtgatg 


gatggtcgcg 


gccgaggtac 


atttaatcca 


tcttctccat 


ttcctccttc 


120 


aggagccagc 


tatgagattt 


cagtgcattt 


ctagcccagc 


ggatgttcat 


tctccactaa 


180 


acttcatctt 


ttactaagca 


aagggggatt 


attccatgag 


gcagccagga 


gcaaggggcc 


240 


atgatattca 
a 


tgactttgtc 


tgctgggcat 


ttttcaaagt 


gtccttgaat 


tcactcagca 


300 
301 


<210> 102 
<211> 4318 
<212> DNA 
<213> Homo sapien 












<400> 102 
gttgtatatg 


cttttttttt 


ttccaaataa 


acttgtcacc 


ctgcatgccc 


ttggcaaata 


60 
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agtgaagcag 


aaataggaac 


acagtccaca 


ttcaagttga 


ggaacagtgt 


atctttaaga 


120 


gctgaccttt 


gggtgacctg 


gaaaggggga 


aagatggcta 


agcatggaga 


gaaacgaggc 


180 


aagagacaag 


ctatgataca 


acaccgcttc 


agcccctgcc 


ctcaatagca 


cacaacccac 


240 


atatcagctt 


tctctagaga 


aggaacctac 


tgtttagtgc 


tcctcacttt 


gcaatgtttg 


300 


tgctacgcca 


gaatttctcc 


agtttttttc 


attatcatcc 


ccctgagaaa 


aaaattacat 


360 


tgaatttaaa 


ttttccctaa 


taagagaaat 


taaatatgaa 


agaataggat 


tttgttgggt 


420 


aagattgagc 


tttggaaggt 


cacgaaccat 


tattctatct 


aaggtgtgtg 


ttttgttttg 


480 


tttttttttg 


tttttttttg 


tttttttttt 


tgcaatccct 


ctccccctga 


accaatttca 


540 


cattgggaat 


gcaggaccta 


gactgctgaa 


taaaaagcta 


cttcttctat 


aattgtcagg 


600 


ttctctccaa 


tttctcagct 


tcctcaaaga 


catggggtgg 


agttgggtgt 


gccatcactg 


660 


agagagctct 


ggaacttttg 


actcttagtg 


acatttttag 


atttaggggt 


tcatggcctt 


720 


ccacatgttg 


ccaccaactg 


gtgatctctg 


ccccttcatg 


ctgatcaaga 


aagtagaaac 


780 


tcctcgtcgc 


cttcaggttt 


gcagtcgcag 


aaacattgcc 


tgctgtggac 


tgtcagcaca 


840 


aaactgggac 


actggtgtca 


tttagactgt 


cagcagtgca 


catgattgta 


cgatagactc 


900 


caggcaacca 


tgtgcatctg 


tgcaagatga 


cctctgcccc 


aagagaaggg 


ttacggtcta 


960 


attaaatgtt 


taccaagatg 


gtaccagtgg 


gctctccccg 


tgtcctttgg 


tgttattgga 


1020 


qctqgtgtat 


gacagactca 


aggaaatttt 


ttaaggaaaa 


tgaagaagaa 


atcaaccttt 


1080 


atggttctct 


ttcattggaa 


gaggagaata 


aggaaagaaa 


tggcaggtag 


agagggaggg 


1140 


ggaaaggaat 


agaaatggca 


tgtctttgat 


gctgtggctt 


gtgtggggac 


aatgggaaga 


1200 


gcacagcagg 


cacaacatat 


ctgtgttagt 


gccacgtggt 


atctgttaag 


tatggccaga 


1260 


gcctcacata 


taagtgaaga 


gagttaagac 


aattcttgct 


ctttgaacaa 


taatagtcta 


1320 


tagaatttct 


attggcaaac 


catcccagac 


aacctgacct 


atcaaacaaa 


agcaaataat 


1380 


ccctgtctca 


gaactgctgg 


tctaaaagct 


agaaggggca 


tgataatgga 


aattgctaga 


1440 


aaagagagaa 


tgctctactc 


tctttcctgt 


gcctacctac 


ccccatccta 


aaccctgtaa 


1500 


aacagaattt 


caaaatagat 


gtcaaatatg 


aagtaattca 


gacttccaaa 


gaaggaaaga 


1560 


gttctgccca 


gggcagtatg 


agcaaatcca 


cagggatgtt 


aagatttggt 


ccaactcaaa 


1620 


ggtttatggg 


cagtgagcct 


agagtctctt 


gagagtaaac 


ccttgcattt 


gggacaagga 


1680 


gaatatgtga 


agttcaggag 


tgctcacact 


agagcaagat 


ccagaaaaaa 


aaaaatccaa 


1740 


tggcatttta 


aactagattg 


cattatcact 


caatgctgtt 


actttgagca 


gacaaatcag 


1800 


ttgaatggga 


gggcaaatgg 


cagaaatgaa 


caaaagctat 


taggtgaaag 


catccccaat 


1860 
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gtatcagttg 


tgagatgatt 


tttgtttaat 


gatgatcagg 


tttacattga 


agtggcttgg 


1920 


aagacgtatt 


tcagagggac 


tgggtttgta 


ctgcaaaact 


ctgaacacta 


gagcaggttc 


1980 


tactagcact 


tgggcagtaa 


ggtagcgggt 


gtgtattatg 


ctattgccat 


agttctcgtc 


2040 


tttgtggatt 


cacataccct 


ttcccatgag 


gagcttgcta 


ctaacagcct 


cctgtttctg 


2100 


ttgtttttat 


atgggagaag 


agaaagagct 


tggaatttca 


attgtctaaa 


caattggtat 


2160 


gatttacaag 


aaggcaaacc 


attcaggaga 


tgttcaacag 


tgtatcattc 


ttagcattca 


2220 


atacaatgtt 


tattataaaa 


tatacacctg 


agtttatgtt 


tttctgccag 


gctgaagggc 


2280 


aataatgtct 


tgctatgcaa 


acactctatt 


ttatgtgtga 


atttttttaa 


ctgtaatttt 


2340 


atgtgtgaat 


ttttttaact 


aaactctatc 


atcattttct 


atgttgacat 


cttttttttt 


2400 


ttttaatctg 


tttccaactc 


tgagtctgtg 


aactatctct 


tctgactgga 


tgctggccta 


2460 


aaatcctatt 


agtgcttaaa 


cagacctcaa 


aacactctga 


acctgtaggc 


caatacagag 


2520 


atgttgttct 


ttgatatatc 


ttgggtcctt 


gatggcttca 


acaaacaagg 


tcataatttg 


2580 


tttaaattca 


gtgttttctg 


tagatacctt 


cttctcagtt 


tcattaagaa 


gagtaaccat 


2640 


ctttggtttt 


ccaaagaatg 


gaaactccta 


ccctagactt 


tagacttaga 


gcgtctgcct 


2700 


ttcacgatga 


gtaagagatc 


cagaatgttt 


aggggatgtg 


cgtcagccca 


cctggtacag 


2760 


tggcaaagct 


gaatgcgaac 


ttgaggcctc 


ctgaggcctg 


accttctatg 


cccccagccc 


2820 


tccaccccaa 


gcaccttttg 


gtgtggctgg 


aaacacatga 


cattggatgt 


gatttttctc 


2880 


aagcccttca 


ctgtggaagg 


catgggagac 


tgcccagcat 


tgggcatgtg 


gctgttaacg 


2940 


tttccatttc 


aagtccctga 


ttcttactgg 


agaagttaag 


gagccactta 


atgttttcac 


3000 


agacctctga 


ttctgttgtg 


aatgtggcat 


tccagtgagt 


acaacctgct 


tgctataaag 


3060 


aaagcagcat 


attttgacaa 


ttttatttct 


tccttgggta 


cttacatttt 


tattacaaaa 


3120 


atggccgtta 


taaaaaaaga 


cagaagggct 


gggcagtggc 


ttgctcccta 


gaaactgaga 


3180 


ttccgaagca 


ggtgtttcct 


ctcccctaga 


ctcagaggta 


catttaatcc 


atcttctcca 


3240 


tttcctcctt 


caggaccagc 


tatgagattc 


agtgcatttc 


tagcccagcg 


gatgttcatt 


3300 


ctccactaaa 


cttcatcttt 


tactaagcaa 


agggggatta 


ttccatgagg 


cagccaggag 


3360 


caaggggcca 


tgatattcat 


gactttgtct 


gctgggcatt 


tttcaaagtg 


tccttgaatt 


3420 


cattcagcaa 


acaaagcttt 


ctggagaatc 


tctcaaaact 


taggccctgc 


tccatttggc 


3480 


caaaaatgat 


ggctgctcca 


aaactctgaa 


cttcttaaaa 


ctccatctgc 


tacattattt 


3540 


ctggagttta 


aacatgactt 


tttctgtctt 


tgagtataga 


tgtgtttgtt 


taattaacga 


3600 
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agcacaagtc tgttaagcag 


aaggctccaa 


93 

gctgtattct 


atacttggga 


atccttggtg 


3660 


ccatctttat tctaccaagt 


gccaatcacc 


atggctaaag 


tgggcgtata 


ttacagcctg 


3720 


tgtcctaagc ttagaagctt 


taatgtactt 


ttttaaatga 


aaagtattag 


agggggttga 


3780 


acattgtaac taaagcataa 


agttagacca 


attacatgca 


gagatgttta 


tttaatattg 


3840 


tgtgagctga gtccttctgt 


ataaattatt 


tgcacacttt 


tcttgcatga 


tgaactgatt 


3900 


ttttatagtt gtttgtacca 


gacggtggca 


tatttttgta 


aaaaactttt 


gacactgaat 


3960 


tgcaataaaa tgtttttcca 


acaaaaaaaa 


aaaaaaaaaa 


tgcggcagag 


gctgcggggc 


4020 


gacgcgcggg ccggcgcagc 


catggtgaag 


attagcttcc 


agcccgccgt 


ggctggcatc 


4080 


ct ciy y y y ci^ d dyy c* cici 


yyuyu^yyuy 


tpcaccccto 




craccaccGacr 


4140 


atcctgctga cgccggctag 


ggaggagcag 


cccccacaac 


atcgatccaa 


gagggggggc 


4200 


tcagtgggcg gcgtgtgcta 


cctgtcgatg 


ggcatggtcg 


tgctgctcat 


gggcctcgtg 


4260 


ttcgcctctg tctacatcta 


cagatacttc 


ttccttgcgc 


agctggcccg 


agatgcgg 


4318 


<210> 103 
<211> 2288 

<213> Homo sapien 






* 






<220> 

<221> mis cofeature 
<222> (948) . . (949) 
<223> a, c, g or t 












<400> 103 

tgtgacggaa acattcacac 


ggaacagcta 


ggccatgata 


cgccagcttc 


cccgtctatg 


60 


ggataccaaa gttgccagca 


attaatttac 


attaccaaaa 


aggagaaacc 


attatttatg 


120 


ctatcaccaa agccataccc 


ttaaaagtaa 


aaacacatct 


ccacttgata 


ttagtaattt 


180 


gagaaaactt agcatatcta 


caactactgc 


ttaatccaca 


acttttcatt 


gattcaacat 


240 


tatcacttaa atgcagatgc 


aaaataaccc 


ttgcatggca 


aacatgttta 


ctctgtgcta 


300 


atgaaactat gaaacagttc 


ctgcaagttc 


acagacacag 


tcaatcaatg 


attgactaca 


360 


gaacagactg gtgagtgtta 


ccaaaaacaa 


aagacctgcc 


caggggacca 


ctaggggtgg 


420 


aggaagaaca cctgggtctc 


actcactggg 


cacttgatgc 


tttcattttt 


atctgtaagc 


480 


caggggacaa ttatattaat 


attgtatgaa 


tacactgcta 


ctgggaaaag 


cattcattca 


540 


acagatgccc agctacatac 


tctgtaccag 


gtagtgtgtt 


aagtgctgaa 


acagggaagg 


600 


acaaggcaca tcctacgggg 


gctcgcagtg 


tgggcaggtg 


ctgagctaac 


gcacacacac 


660 
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aagggctggg tacctgccag atctgtatca ctccggagcc aacgacctgt ctctctgaat 720 

gatctctggc atggtaaaag agttaactgc tatttcaaac aggctgaaca caaaactttt 780 

tttgccttcg cttactgcac tgcttctcaa acttgactca cgctggcttc acggcctgcg 840 

acgtactgca ctcacgaact tactcctgct gaacgccttg cgccgctaac tccactgcgc 900 

tcactacgcc tgcttctccc tgctgctgca cacactctgc tctactcnnc cgctcacaat 960 

ctctccacac aacataccgg agccagcccc ccccccctcc ccccgccccc ccccctgccg 1020 

cccccccccc cccccccccc cccccccaac actataactc ctgcgctcag cgcctcacct 1080 

gcccccgcac tccccccagc tccggcccac acaccatgcc ccgcagccca accctcgcca 1140 

ctcccacact gcacctccgc gcctccaaac agcccccgcg accaacacac accgcgccca 1200 

cacgccgact acccccggcc gcccagccca cccaccactc ccgacatgcc ccctccgccg 1260 

ctccaccata gaacacctcc gccccggccc cgcgcccgac ggccccgaca ccgcgccccc 1320 

gccgccgcca acgcccgcac gccaccaccg ccacccaccc ccacacacgc acccagccac 1380 

cactacccgc caccatcccc accccaccac tcccacccgg cccacaacca ccgcccggcc 1440 

acccaccaac acacgcccgc acgaccccca ccccgccccg ccccaccacc tcgccccgcc 1500 

aacaaccgcc acccacaacg cgcccgcccc accgccccag ccacccatcc acacccaact 1560 

accagccccc gccccccccc ccccgccacc gcaccccact ccaccaagac gaaaactccc 1620 

gacaccgccc actaaacgca ttacagcaag ccgctgagac ccgaaaacaa ccccccgcaa 1680 

gcagaggcac cgctaccatc aaacgcaaca accacacaac gcacggctat catccccccc 174 0 

ccaccgtgac cacaaacgca ccaccccgcc cgaccagcac gccacacccc cacggccatc 1800 

acgcacagca ccacacccca cgcactcaca cgacgaaata cccccacgcc agccgcccaa 1860 

ctcccacccc ctacggcaca aacccacacg gcggacccac acctaccaca cacacgcaac 1920 

acaggacgca tcagaactac ctaaccacac gcacgaacac acaacagcac accacccccc 1980 

aaccacggac gcacgagctg tcgacccact accaccacac acgatgaccc gacacagcgt 2040 

gcgagcgcca tacccgcgga gcagcacagc caatgcacgg ccccacgacg gcgacacgcc 2100 

gccatccttc agcctcgcgc cacccccgtc caccgaacca agaccaagac gcagaatcgc 2160 

aggccgcacc acacccgcta cggtccgtag cgagacagca aacaacaaga cgactgcggc 2220 

gccgccacca ccgccaagca actacgcaca accaccgacg cctcatcacc ccaccaccac 2280 

aacactta 2288 



<210> 104 
<211> 592 
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<212> DNA 

<213> Homo sapien 



^TtV/U^ \J ^* 

ggatatcgac 


taactatagg 


ccatgttgac 


taatgctgct 


cgagccggcg 


ctgtgtgatg 


60 


gattggtcgc 


ggccgaggta 


cacatattca 


tgccactcgg 


ctctgaaaag 


aggttcaagg 


120 


tgggtcaagg 


tgggtcttgg 


ccagtggagg 


aaggaaggtg 


tccaggactt 


taggttaatc 


180 


aacagtgggg 


acagagagga 


atgatttccc 


ttgtgaaaac 


aacagggttc 


ctttctcata 


240 


ttcttgtggc 


cagaaactgg 


ggtgaacttc 


agttgtggtg 


taattgaaat 


gaaacagtga 


300 


gagccatttc 


tccatggaac 


tcctatgacc 


tccattttaa 


cttctgacaa 


agttaacttc 


360 


atttatacaa 


tctgtatttg 


aaaacagtaa 


tcacaaccaa 


aaaggtccta 


taaacctgta 


420 


atagatgtca 


aagggattca 


cattctgaac 


tttaatttta 


aggacccttt 


aaaaggccta 


480 


gacttggatt 


aaagtaaacg 


taatattcca 


aactaaaaca 


aggcaccata 


aaaaatcaac 


540 


tcaaaacatc 


caaacaatgg 




feaafccrfcaaafc 


tattttactt 


tt 


592 


<210> 105 

<211> 2180 

<212> DNA 

<213> Homo sapien 












<400> 105 
tttttttttt 


acatctgtac 


aagatttatt 


tcatttacca 


caagttttag 


agtgaaagta 


60 


tatgccctaa 


tacagaagga 


gtgattcaga 


acccaccaaa 


agatcaagag 


ccaaagtcag 


120 


ggcttgtatc 


ccaggtcaat 


aggagtttca 


tccgccattt 


gtttttctgg 


gattaaagag 


180 


atgatgtgca 


gaacgcccgg 


gctgagctca 


ctatggcttc 


tgacatacaa 


gcaccactag 


240 


gaacaagact 


tcatgcttca 


tccttgctct 


ggctgtgcta 


gtgagtggca 


aaagcctctg 


300 


ctcgtgcttg 


ccctccaaat 


ttttcacaag 


cattcctgtt 


actagccata 


gcttcccatg 


360 


acttcacatt 


tgagtaagtt 


atcctgaagt 


gtgtggctgt 


aatattgaca 


aatagaacta 


420 


atgagagtga 


actgccatct 


gtttaacaaa 


acgcactctt 


ctaagagaat 


acccagtctt 


480 


agtgcattta 


gtacatagtc 


acaacattga 


aatgacgtga 


gaatcaatac 


aaaatattta 


540 


ttttttttca 


aaccacagaa 


ttcttaaccc 


cagagccaca 


caataaagtt 


ctcagaattg 


600 


taagccatta 


acatttttct 


aaacaatgca 


gtccagagat 


gaagataatt 


tccaaccagc 


660 


agggatgcaa 


tatatagtag 


gttcccctat 


gaatgaagct 


caaattagca 


tttcctttaa 


720 


ttctcccaca 


gccactccat 


caacagaagc 


agaaacagta 


cacatattca 


tgccactcgg 


780 


ctctgaaaag 


aggttcaagg 


tgggtcaagg 


tgggtcttgg 


ccagtggagg 


aaggaaggtg 


840 
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tccaggactt 


tagttaatca 


acagtgggga 


96 

cagagaggaa 


tgatttccct 


tggaaaacaa 


900 


cagggttcct 


ttctcatatt 


cttgtggcca 


gaaactgggg 


tgaacttcag 


tggggtaatg 


960 


aaagaaacag 


gagagccatt 


tctccaggaa 


ctcctatgac 


ctccatttta 


acttctgaca 


1020 


aagttaactt 


catttataca 


atcgtattga 


aaacagtaat 


cacaaccaaa 


aaggtcctat 


1080 


aaacctgtaa 


tagatgtcaa 


agggattcac 


attctgaact 


ttaattttaa 


ggacccttta 


1140 


aaaggcctag 


acttggatta 


aagtaaacgt 


aatattccaa 


gctaaaagag 


gcaccataaa 


1200 


aaatcaactc 


aaaacatcca 


aacaatggct 


agatgactaa 


tgtaggttgt 


tttgcttttt 


1260 


agttgcaaag 


cttttcagta 


tctcagatta 


gtgtatgttc 


ataaaacaat 


gctcagttat 


1320 


tttaatagct 


gcttatgaaa 


caataacagt 


ttaactcaag 


ggcaatgctt 


cttgcataat 


1380 


aatcacaaaa 


ataattaact 


gctataaaac 


gggaaaaaaa 


gtagaagaaa 


ataaagccag 


1440 


cctcaataat 


aaaaaggcaa 


aatctggagg 


ggttactcgg 


cttaaaaaga 


gattaaccag 


1500 


gattatttaa 


atactataat 


acacagtgct 


cctgctcact 


tctaactgca 


gaaaccaatt 


1560 


ttgtttgcta 


gatcaccatt 


acctttgcta 


gtatgcgtac 


agaccaccac 


tcggaagtta 


1620 


tccttttgtg 


ctgaaaaacg 


ttcaaatccc 


ttgtttggtc 


agtacagaat 


attgcgaggt 


1680 


gatgctcatg 


caaactcttc 


ctgaggaatt 


tatgtgtgca 


aatctgcaac 


ccgacagcat 


1740 


ggcacgcagc 


cagggagtgg 


tagctgcaca 


gtgtgagcac 


tggagatgga 


tgtgcagtgt 


1800 


gcagtgttca 


cagccatgga 


catccattct 


tctgcaatct 


catctaccca 


caaattagct 


1860 


ttcactctag 


accccaaagg 


gagggtaatt 


gctgcaaatt 


tgttaaaggg 


acagaagaaa 


1920 


aagtagcttg 


tctacaaaat 


aatgcacaat 


gcatgcatct 


gggtttgtgt 


ttcttctcac 


1980 


taaccttgcc 


taagaaccat 


taggataaaa 


gtcacaacac 


caggttttgc 


tttctgcccc 


2040 


acaaaaaaac 


agtagttaat 


tcctgtcagg 


ttagggtaca 


ggtgtgacaa 


caaaaggtca 


2100 


caaaatgaca 


atgttactga 


agcttaaggc 


caacctttaa 


aacatgtacc 


gtctctcaaa 


2160 


acaattatcg 


atttactttt 










2180 


<210> 106 
<211> 611 
<212> DNA 
<213> Homo sapien 












<400> 106 
cgggcgtctg 


gcgcaggtac 


gttacggagc 


tatctgggag 


gaggaagccg 


tccacccaag 


60 


ctcacccgac 


tggttcaagg 


ttaacaatgg 


agagcctcac 


aaacacacgg 


aggcacttgc 


12 0 


gagaggcaga 


ggctggggcc 


acagagctgt 


gaggagagag 


tgaatctccg 


tgcatggcag 


180 
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97 








acatgttgtg 


agaggcagag 


gctggggccg 


cagagctgtg 


aggagacagt 


gaatctccgt 


240 


gcacggcaga 


cacacaaacc 


ccatcctcag 


ctgcagattc 


acaggagagc 


atgggggcaa 


300 


gtgggctctg cagttataca 


gcaggacagc 


gcacggcctt 


gggcaggcca 


caatttctta 


360 


aagggaatgg gaaaatgaga 


gcttacggaa 


agcaggataa 


agcgccctcc 


actcatgctg 


420 


aaaagcacga 


agacctgctt 




appl* nr c t* fifl 

CL \* LVHw\< LMu 




otcaoctacc 


480 


aatgcaagac 


acaagagcaa 


gaaggcccaa 


gtgcagagag 


cgcaagttaa 


ctcctccagc 


540 


tccaagacac 


agggtgagag 


caggctacgt 


gtgggcagag 


acggcatcta 


tctgcccatc 


600 


tctgggaaaa g 










611 


<210> 107 
<211> 1845 
<212> DNA 
<213> Homo sapien 












<400> 107 
cacgaccatc 


tcgcactacc 


ctgactacag 


ctggagcgcc 


cagaggcgag 


ccctggtgga 


60 


ggaagaggtg 


agctgagtgg 


ggcctatgct 


gggggaagct 


ggcccttccc 


acattccacg 


120 


ccctgagtgc 


aaacrfc crc tat 


tttccgccgg 


ggtgctttga 


gtctcgctct 


gtcaccaggc 


180 


tggagtgcag 


tggtgcaatc 


ccggctcact 


gcaacctccg 


cctcctgggt 


tcaagcgatt 


240 


cttctgcctc 


tgcctcccga 


gtagctgggg 


ttacaggcac 


acaccaccac 


gcccagctaa 


300 


tttttgtgtt 


tttggtaggg 


acggggtttc 


accgtgttgg 


ccaggatggt 


ctctatctct 


360 


tgacgtcatt 


tgtgatccac 


ccgcgtcagc 


ctcccaaagt 


gctgggatta 


caggcgtgag 


420 


caatgtaata 


aacagacttt 


attttttaga 


gcagttgaag 


gtaacaggaa 


gtggagctga 


480 


aaacacaggg 


tggccccatc 


catgccctct 


cacagcccct 


ccttcctctg 


agcccctcac 


540 


gtccttggct 


ctcctccgaa 


gcttcttttc 


ccagagatgg 


gcagatagat 


gccgtctctg 


600 


cccacacgta 


gcctgctctc 


accctgtgtc 


ttggagctgg 


aggagttaac 


ttgcgctctc 


660 


tgcacttggg 


ccttcttgct 


cttgtgtctt 


gcattggtag 


ctgactgtcc 


cggcctcagg 


720 


caggtcattc 


ctggaaagca 


ggtcttcgtg 


cttttcagca 


tgagtggagg 


gcgctttatc 


780 


ctgctttccg 


taagctctca 


ttttcccatt 


ccctttaaga 


aattgtggcc 


tgcccaaggc 


840 


cgtgcgctgt 


cctgctgtat 


aactgcagag 


cccacttgcc 


ccctgctctc 


ctgtgaatct 


900 


gcagctgagg 


atggggtttg 


tgtgtctgcc 


gtgcacggag 


attcactgtc 


tcctcacagc 


960 


tctgcggccc 


cagcctctgc 


ctctcacaac 


atgtctgcca 


tgcacggaga 


ttcactctct 


1020 


cctcacagct 


ctgtggcccc 


agcctctgcc 


tctcgcaagt 


gcctccgtgt 


gtttgtgagg 


1080 
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ctctccattg 


ttaaccttga 


accagtcggg 


98 

tgagcttggg 


tggacggctt cctcctccca 


1 1 A ft 


gataccctca 


cgtaccctgg 


ggttctgggt 


gtcacagaag 


ctgctctggg tgcggccact 


1200 


ctggagcggg 


gcccttgtgc 


caggtgagac 


acccttctcc 


tcagcaagcg gacttggtgc 


1260 


gcagcaggcc 


aatacgtcgg 


tgccgacatg 


ccatgaagca 


cgggaggccg taggagattg 


1320 


agggaggtga 


atgactcagt 


cctcatcgta 


aggagcgcgc 


tgaggccgaa gatgagcagc 


1380 


ggagctggtg 


catggggccg 


actggcccca 


tccatgccct 


ctcacagccc ctccttcctc 


1440 


tgagcccctc 


acgtccttgg 


ctctcctccg 


aagcttcttt 


tcccagagat ggggcagata 


1500 


gatgccgtct 


ctgcccacac 


gtagcctgct 


ctcaccctgt 


gtcttggagc tggaggagtt 


1560 


aacttgcgct 


ctctgcactt 


gggccttctt 


gctcttgtgt 


cttgcattgg tagctgactg 


1620 


tcccggcctc 


aggcaggtca 


ttcctggaaa 


gcaggtcttc 


gtgcttttca gcatgagtgg 


1680 


agggcgcttt 


atcctgcttt 


ccgtaagctc 


tcattttccc 


attcccttta agaaattgtg 


1740 


gcctgcccaa 


ggccgtgcgc 


tgtcctgctg 


tgtaactgca 


gagcccactt gccccctgct 


1800 


ctcctgtgaa 


tctgcagctg 


aggatggggt 


ttgtgtgtct 


accgt 


1845 


<210> 108 
<211> 160 
<212> DNA 
<213> Homo sapien 










<400> 108 
ctgttacttc 


cttttgctgt 


cagagggttg 


ttgacaatgg 


caagggggga tgttagtgag 


60 


atccaggtgg 


tggtggcttc 


ttggtccact 


cagcttgcac 


acatgcaaga ggagggtctg 


120 


tggcccctca 


gccgtgctgg 


tggcctcctt 


cctcaagctt 




160 


<210> 109 

<211> 4621 

<212> DNA 

<213> Homo sapien 










<400> 109 
ctgagagaag 


cacgggggga 


ttgattggtg 


cttttagaga 


acacacctag atacaaatgt 


60 


gatacatttc 


tatactcaaa 


ttagaacaca 


tgataatgta 


gatttgttca ttcatcaaag 


120 


acaatcaccc 


attatagttg 


taattctcag 


tatttggctt 


ctcttgtgga tgatcagtat 


180 


tacttttctt 


tcataggtat 


tggatatgct 


ttttaggtct 


atcaaaaact aaagtctctg 


240 


aagtgacact 


ggtacaagca 


attcacttga 


atttctaaaa 


acagataaaa tataatggac 


300 


cggccgcggc 


cgccgggagt 


tttttatttg 


gaatattatt 


agatgatgcc ctatgataag 


360 


atgagacaaa 


tgatggggag 


ggaaggagga 


tggcccttct 


ctagaatcag caaacccaat 


420 
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ggttctgtga 


aggtcagacc 


caagcttgga 


gcagctggtc 


tggatgcaga 


tccaggagct 


480 


gcagagtgtg 


gaacaggaaa 


ggctctgccc 


agggcccacg 


agctctagca 


ttcctgctgg 


540 


cagattagag 


atcttagtta 


aatttgacca 


agaaaggagc 


ttagctaaga 


ggtctttgtt 


600 


tccagaggac 


ccaagactac 


cagactcctg 


tgcagtctgc 


ctgtctgcag 


agcctccaat 


660 


cacggtttaa 


agtggtcttg 


gcctcccagc 


ccccactgct 


gcagccttca 


gagccagcct 


720 


ctggactcag 


cgccaggagc 


ctcctctcct 


gctgccctgg 


tacctgtcat 


cctagtgtca 


780 


cctatttgtt 


ttgcacagca 


ttctggagaa 


catgtgcctc 


acaaagtgat 


cctgtttttc 


840 


tcaccctcca 


aagctgaggt 


ggctaaggga 


ggcaggggtg 


agagagctca 


ctctctaaac 


900 


cacaggcagg 


gcaaggtgct 


tctaggccgt 


gtcccctccc 


tggggaatgg 


tcactactcc 


960 


tcagcatggc 


cacaggcccc 


tgtggttttg 


ggagtttgct 


gagtgactgg 


aatgggctga 


1020 


ccccagccgt 


ttccctgctg 


tggaagcagc 


aaggtagggg 


cactgctcgg 


tggtttgtca 


1080 


agctcagagg 


cagccacgga 


tggttaggaa 


caggaagcca 


cactgctaag 


gtgtgatgag 


1140 


ctagaaggga 


gacatcctgg 


acttcaagtg 


agacaagtgc 


agaggaggcc 


actccactta 


1200 


ggaccttcct 


cttagacctg 


catgagcacg 


tgtttcactt 


aagcacattg 


ggtccgaatt 


1260 


gtggcgcatg 


aacccagctg 


ttaaaccagg 


taccctggga 


ctgccctctg 


cctctcagag 


1320 


ccctacctgt 


cctggacttg 


aaggcgagca 


tcaggcaggc 


cttctcagcc 


catcttccac 


1380 


caggagtgca 


gagccagggt 


gctgagagga 


gcagggactg 


taactacagt 


aggaggcaac 


1440 


tttgctggtg 


tcattcctgc 


tcggattgac 


tttgtgctcc 


ctcccatcca 


cagctatggt 


1500 


agccagtatt 


tccccagcag 


ctctctagtg 


gggaagagtg 


gctagaacca 


ggccattctc 


1560 


agtcgaattt 


aggaagacca 


agactcatgt 


tcctgacagg 


acggtcattc 


cctcatggca 


1620 


tcactgactc 


agttctaatt 


cctcccctcc 


tgtttcattt 


tggtttgaac 


aacctgtgcc 


1680 


tgcagctgtt 


tcttccacct 


tcctggagct 


cacaattgga 


gcttcatgtt 


cagtcttgag 


1740 


ctcctaatgt 


ttccagacag 


ggcaggggat 


aggacagcca 


taatctatga 


aagctgtgaa 


1800 


acaaaccctg 


ggttatatag 


acctcagatg 


gtctcatctg 


aaggctttgc 


ccacttccca 


1860 


gtgagcagag 


ggtttgggga 


acaaagaggt 


agctgaagag 


tggtggtttt 


tccctgtatc 


1920 


ctgattttca 


gaaagggctt 


tagtctctga 


gctgtggcta 


gtagctggct 


ggctgaccag 


1980 


gggacggggg 


tctaataggg 


tgtctgacca 


taggacacct 


ctcgtgtttg 


gccttccctt 


2040 


tcttgtgtgt 


ttgtgtgtgt 


gtgtgcatgt 


acacacacaa 


atctgtatcc 


ccaggagtgc 


2100 


ctcctgcagt 


cactgggggc 


tcttccgtgg 


tgcaccatcc 


catgagagca 


gatcagaggg 


2160 
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ctgtgqtcag 


gacttcctcc 


cgagctgccc 


tccctccctg 


ggatgagttt 


cctctccacc 


2220 


tgctccaagg 


tgccttacga 


ggtgcagctc 


ttcacctggc 


cacatggcat 


ccatcaccgt 


2280 


gttgagaaat 


ggtgggacgt 


tggctggata 


ttgtgtgcta 


tgaatctgat 


ttgtccaaat 


2340 


ctggacctca 


gccaagccac 


ctgtggtgga 


accttgcaag 


tcaaggtcat 


gaacaaattt 


2400 


cttgtaacta 


ttggaaataa 


actaatgagg 


tagaactgtc 


atatccaaat 


ggaagaggtg 


2460 


gttgqtactg 


ttacttcctt 


ttgctgtcag 


agggttgttg 


acaatggcaa 


ggggggatgt 


2520 


tagtgagatc 


caggtggtgg 


tggcttcttg 


gtccactcag 


cttgcacaca 


tgcaagagga 


2580 


gggtctgtgg 


cccctcagcc 


gtgctggtgg 


cctccttcct 


caagctttaa 


cttgtgccct 


2640 


ccctggcaac 


agggagtcat 


gggttaatgt 


cacctgtcat 


gcctctttct 


tcagggaaaa 


2700 


tattaaaagg 


ctcaaatcca 


agctgtgact 


tcccttagac 


agtgtaagcc 


tgtagggtgc 


2760 


agccccaagt 


ttatgtccaa 


aggctgcctc 


agaaccctgg 


agggagcagc 


gcccagctgg 


2820 


cagctcactc 


tccccttgca 


gttcactggc 


cttttcgcag 


ccaggctaag 


ataggcagtg 


2880 


caaggagctt 


cccactggcc 


agagcagctg 


ccttgtagaa 


tgtggggctg 


gcttttagaa 


2940 


gtttcacacg 


cttgcagcta 


agagtgcaga 


gctctctgtt 


ttacttcgtg 


aaacattagc 


3000 


agctctgagg 


ggcttggctt 


ttggtaaggg 


acttatagtc 


tgacttgcca 


caagcacctt 


3060 


gagctcctgg 


gttggttggt 


ggttggttgg 


ttgggttgtt 


ttaaatttga 


tttttctttt 


3120 


cctgaagttt 


gttagaagtc 


agacctatgc 


aaggaaaacc 


atggcccact 


tcagcagcca 


3180 


tccagtgggg 


gtgcttaatc 


ggtggtggta 


ggattcatgg 


aaacgttttt 


ttgaaatgaa 


3240 


agagaagatg 


atgtacttac 


attgtaaaat 


ggtttacaat 


aaagtttggc 


atcttattac 


3300 


agtttttttt 


aaaaagaaac 


catcacattt 


gtggttgttg 


gttgtgggtt 


ctctggagtt 


3360 


gaaacagctt 


ggcagggtca 


aaaaagattc 


ccctgtgaat 


tagaaggttc 


tccccaggga 


3420 


ccagagtgag 


aaggaacctt 


acgaagcctc 


atggcaggag 


tgggagccag 


caactgccca 


3480 


gtgaaaagcc 


aaacatgctc 


ttgccatgtc 


catctccact 


tcttcgctag 


tgagaattaa 


3540 


agcaagtccg 


tctctgagga 


ggacatcgtg 


ttggggaggt 


agagtataag 


cagtgagtag 


3600 


cccacatcct 


tgtccctgag 


qctqcqgggc 


tggaaggagg 


tccctgggca 


gagccgcagc 


3660 


atcaggtgcc 


atctgcaggg 


gcacccctgg 


aactagtgga 


aggcccccac 


aaggcctggc 


3720 


ctgccctcag 


ctgcctcttg 


gcccatgtgc 


cacatcctgg 


gcattcctca 


tgtcagcact 


3780 


tgtctcagat 


caagcctggc 


agtcctctca 


ttggcctcct 


ggagccaggg 


tgtcaggagg 


3840 


gggattgggg 


tggggttcaa 


agtatgacag 


caatggagtg 


gcatttggga 


taaccaaatg 


3900 


tcagtgcctg 


gttgagttcc 


atctgcctgg 


agccttcgag 


gataaaaata 


atctccatca 


3960 
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taaactgaaa aggacttaga tatgtgtgtt actgggaggg ctgcgaccta tgcacaccta 4020 

ttccccatac ccaccccagg cgtaggctcc tggcctgtgc cctgggggat cttccccttg 4080 

caaggagggg tgtgttagga gcttctagca aaggagagct aagtgacctt ccggggagcc 4140 

cttgggcctg ttcttgtctc tgcagctcag tatctggagt gggcttcact ctcccccacc 4200 

ctctttgagc aactgaaact gtcctgcagg ccagcccccc tgcggtggcc tcacctactc 4260 

ggaggtgagt aggctcaggc ctgagcctga gcctgtttcc cacctctgtc tcatcagtgc 4320 

tgtcaccatc gagcatggtt tcctccctgg tcctttccag accccttact cccccctctc 4380 

tgaacttgca gttggaatag gatagaggac aaggacatca gcaaaaacat gagcaactcc 4440 

ccaggctccc tttacctcac tttctccacc tgtgaacctg tggtagcccc tctctgagtt 4500 

tgctgcaagg gcccagtgag cttgccataa tcagacgcag cagacacggg aagctcctgc 4560 

tctgccacca tgcagggatg gaaaatctgt ccactggaat ccaggggaag ggatggaaga 4620 

g 4621 



<210> 


110 


<211> 


303 


<212> 


DNA 


<213> 


Homo sapien 


<220> 




<221> 


misc_f eature 


<222> 


(176) . . (255) 


<223> 


a, c, g or t 


<400> 


110 



tccctgttat tgttccgtat attacctgta agcagatact gtattttatt ttagcctatt 60 

tgacagaaca catcactcag aaaaagtgaa gtttcagagc aaacagtgaa gaaatcagtg 120 

tgattgtaga caaaaagtcg gttcacagaa cggagcagcg gggggaggaa gggaannnnn 180 

nnnnnnnnnn nnnnnnnnnn nimnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 240 

nnnnnnnnnn nnnnncccgg gggcagccgg ggctatgagg ggacgtgctg atgcgtgatg 300 

atg < 303 



<210> 111 

<211> 1848 

<212> DNA 

<213> Homo sapien 

<400> 111 

tgtttttttg agacacgggt ctctggttaa ccccggctgg ggtgccatgg tgccatctcg 60 
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gctcactggc 


aacctttgcc 


tccccgattt 


aaggcggttt 


tttcccgcct 


ctagccctcc 


120 


ttgagtatgc 


ttgggactac 


agggcgccca 


cctctcacgc 


ccggcttatc 


tctctgtaat 


180 


tcttagtaga 


gacaggcttt 


tcacccacgg 


tggcccggct 


ggtctccaac 


tcctggcctc 


240 


cagtggtccg 


cccccctttg 


ccttcccaag 


tgctgggatt 


accggggtgg 


ggcccccgtg 


300 


cctggcccgt 


cctatcctcc 


cccccaccct 


tttttttttt 


tttttgacat 


ggagtctcac 


360 


gccatcaccc 


aggctggagt 


gcagtggcgc 


catcttggct 


cactgcaacc 


tccgcctccc 


420 


atgttcaagt 


gatcctcctg 


cctcagcctc 


ctgagtagct 


gggactacag 


gcgcccgcca 


480 


ccacgcccag 


ctattttttt 


gtatttttag 


tagagacggg 


gtttcgccat 


gttagccagg 


540 


atggtctcga 


tctcctgacc 


ttgtgatcca 


cccgccttgg 


tctcccaaag 


tgctgggatt 


600 


acaggcctga 


gccactgtgc 


ccagccttat 


cctcaatttt 


ctaagcataa 


ggatttaaga 


660 


gttagtggga 


aagcatgact 


acggtttaaa 


agaacctggt 


cagctgaagc 


agtcccactt 


720 


ggccattcca 


gtacacttcc 


tactctcgct 


tcatggagaa 


gaccgtctac 


tgacaagaga 


780 


atccatcccc 


tttgcacaca 


gtggtcacct 


gacagctcag 


aggtcaactt 


cttggttttg 


840 


aggaaaaatt 


caaaaggctg 


cacgtttaaa 


ggagttgttc 


catttcagtg 


ccccattagc 


900 


ctgtctttcc 


tcagaaattt 


accaatctct 


tgatgtgata 


agcaccaaac 


tatgaagctt 


960 


ttcccttcct 


ctccccgctg 


ctccgttctg 


tgaaccgact 


ttttgtctac 


aatcacactg 


1020 


atttcttcac 


tgtttgctct 


gaaacttcac 


tttttctgag 


tgatgtgttc 


tgtcaaatag 


1080 


gctaaaataa 


aatacagtat 


ctgcttacag 


gtaatatacg 


gaacaataac 


aagggactga 


1140 


aggtacaaaa 


cacacacgga 


cattatggtc 


atataaaata 


tatagaagta 


cctaacgtta 


1200 


acaatttttg 


taagaagttg 


cacattttgc 


tattttttta 


aagcatttcc 


ccacattttt 


1260 


tttcatctgt 


agatcctctg 


cagttaagat 


tgagcacccc 


cccgcaactt 


ttttttttga 


1320 


tacagagcct 


cgctctgtca 


ctcaaggctg 


gagtgccagt 


ggcatgatct 


tagctcactg 


1380 


caacctccgc 


ctcgcaggtt 


caagccattc 


tcctgcctca 


gcctcccgag 


tagctgggac 


1440 


tacaggcgtg 


cgccactgtg 


cccagctaat 


ttttttttgt 


atttttagta 


gagacaaggt 


1500 


ttcaccatgt 


tggccaggct 


ggtctcgaac 


tactgacctc 


gtgatccgcc 


tgcctcggcc 


1560 


tcccaaagtg 


ctgcgattac 


aggcgtgagc 


cactgcgccc 


ggccaagatt 


gagctccctc 


1 O ft 

1620 


ttgcaatgtt 


tctcaaattc 


atcacgattg 


gaacataccg 


acggaattca 


ttttctgtct 


1680 


gaagcaatga 


gtgctataca 


tgaaagtttt 


cctcacatat 


aatattcatc 


aagtttattt 


1740 


acagtataag 


ctttccgatg 


ctaagggctg 


agatgtgatt 


aaaggctttg 


ccgtattcat 


1800 


cacattcagg 


aggctggggg 


ggggggatta 


atttaattaa 


ggatcctt 




1848 
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<210> 112 

<211> 333 

<212> DNA 

<213> Homo sapien 

<400> 112 

ggccgaggtc tagaatgggt gaaggaatgt gttagtgtct ggagccatga aggccttgtg 60 

ctgaccaagc tgctcacctc ggaggagctg gctctgtgtg gctccaggct gctggtcttg 120 

ggctccttcc tgcttctctt ctgtggcctt ctctgctgtg tcactgctat gtgcttccac 180 

cctgcgccgg gagtcccact ggtctagaac ccggctctga gggcactggc ctagttcccg 240 

acttgtttct caggtgtgaa tcaacttctt gggccttggc tctgagttgg aaaaggtttt 3 00 

agaaaaagtg aagagctgga atgtggggga aaa 333 

<210> 113 

<211> 2269 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (2266) . . (2266) 

<223> a, c, g or t 

<400> 113 

gcgagcggct ggcggatccg acgcgcgaga ccgggagggg acgagggcgt tgcaatcgtt 60 

cggggcgggg gctttccggg gagggggtgc tcaggtgcac cagcggcggc ggaccctcag 120 

actctgccct cccctccctt taaccccctt ccagccggac gggaggcggg gcagggctga 180 

gcatttgtga cacctacatt tccgtggctc ccttcttttc ccccgacccc tgtttatctc 240 

ttcgccttcc agaagttctt ttccatcagg ccgtcgcacc ttgcgtggga aggagcaccc 300 

cacttggaag caggaggcgg ggttcagatc ttggccctac ccctcctgtg ttaaagtccg 360 

cgagcctcag tttccctcac agtatttttt gcctcgcctt acccggtttt gaggatctgt 420 

acgagaaaga gaaaggaagt ggacatttgt tgaattcctg catggccaaa taccacgcag 480 

actgcttcat ccgccacgtt taatccttat tacttggtgt tctcagaact cccatttcat 540 

ggattcttaa gctcacagag tcagtgaata acagaaaggg attcagatct agccgtttag 600 

ctgcacagtg gagttcttct ccagagtctt cccttgtctg ggctctggct ggaactattc 660 

ctcagccaaa tcctcgcccc agaacagtgc ttcctgtttc tccagctgag aagtctccct 720 

ttcagtttcc ttcttccagc acggagtaca ctgctctgcc tccacttaga ttacttcaga 780 
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aatgaaatgc 


agcaaatatt 


tatccagcag 


tgcagggagt 


tgaacttttg gagtcgggaa 


840 


ccttggattc 


ttgttctggc 


tctgccactt 


actgtgtggc 


cttgggaagt 


cctttgtctt 


900 


ctctgagctt 


tcttttctct 


ttgcgtaaaa 


gcggtgctct 


tgtcccattc 


tccctccctg 


960 


tcttccagca 


ggctctcccc 


ggaggctcag 


ccccctctgc 


tccccatggg caactgccag 


1020 


gcagggcaca 


acctgcacct 


gtgtctggcc 


caccacccac 


ctctggtctg tgccactttg 


1080 


atcctgctgc 


tccttggcct 


ctctggcctg 


ggccttggca 


gcttcctcct 


cacccacagg 


1140 


actggcctgc 


gcagccctga 


catcccccag 


gactgggtct 


cttttttgag atcttttggc 


1200 


cagctgaccc 


tgtgtcccag 


gaatgggaca 


gtcacaggga 


agtggcgagg gtctcacgtc 


1260 


gtgggcttgc 


tgaccacctt 


gaacttcgga 


gacggtccag 


acaggaacaa 


gacccggaca 


1320 


ttccaggcca 


cagtcctggg 


aagtcagatg 


ggattgaaag 


gatcttctgc 


aggacaactg 


1380 


gtccttatca 


cagccagggt 


gaccacagaa 


aggactgcag 


gaacctgcct 


atattttagt 


1440 


gctgttccag 


gaatcctacc 


ctccagccag 


ccacccatat 


cctgctcaga ggagggggct 


1500 


ggaaatgcca 


ccctgagccc 


tagaatgggt 


gaggaatgtg 


ttagtgtctg gagccatgaa 


1560 


ggccttgtgc 


tgaccaagct 


gctcacctcg 


gaggagctgg 


ctctgtgtgg 


ctccaggctg 


1620 


ctggtcttgg 


gctccttcct 


gcttctcttc 


tgtggccttc 


tctgctgtgt 


cactgctatg 


1680 


tgcttccacc 


cgcgccggga 


gtcccactgg 


tctagaaccc 


ggctctgagg gcactggcct 


1740 


agttcccgac 


ttgtttctca 


ggtgtgaatc 


aacttcttgg 


gccttggctc 


tgagttggaa 


1800 


aaggttttag 


aaaaagtgaa 


gagctggaat 


gtgggggaaa 


ataaaaagct 


tttttgccca 


1860 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1920 


aaaaaaaaaa 


caagcaaaaa 


gacaagagaa 


gaaaacacaa 


gaagaaaaaa 


aaaccacaaa 


1980 


caaagacaaa 


agaaaagaaa 


agaagagacg 


aaaagagaga 


aacaagaaag 


aggagaggac 


O f\A f\ 


aaagagcaag 


aacgaagaga 


gaagaaaaga 


gagcacacaa 


agagaagaga 


agaagcgaga 


2100 


ggaagcgaaa 


aagagagagg 


aaaaagagaa 


aagaagagaa 


gaccaaaccc 


gagaagaaga 


2160 


gagaggaaag 


gcgagcgagc 


aaagagcacg 


agaagagaag 


aaaggaagcg 


agggtgccaa 


2220 


gagaagcgga 


gagaacgaga 


gggagaaaag 


agggagagga 


agaagnagt 




2269 



<210> 114 

<211> 550 

<212> DNA 

<213> Homo sapien 



<400> 114 

agagatatat agcccgtctg ggctaacttg tgtcctctag atcatgcgtc gagcggtcgc 



60 
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cagtgttatg 


tatcgctggt 


cgcggccgag 


tttcagtgaa 


ctttctcctg 


gaagcaaagg 


ggcttctgga 


ggctgttttt 


tggttactgg 


gaagaggggg 


aaaggcatgg 


gccataaata 


gcatctgatg 


gaatgaggag 


ggtcaggctg 


ttagggacac 


acttgagctg 


gttcattccc 


ggcaggccca 


agcacatgtt 


cagtctaatt 


ttggttgccc 


tgatcacagc 


cgagctgccc 


tccatcttct 






<210> 115 

<211> 4441 

<212> DNA 

<213> Homo sapien 




<400> 115 
aacacracrcrca 


tcgccgcccg 


cgcccctcta 


cctgcaagca 


gcggcgcctc 


ggccctcgac 


tcccgcggcg 


gggaccggga 


ccggggaccc 


ggggctcttc 


cgcgtgtgtc 


tggtggtggt 


cccgcgggag 


aacgccacag 


tccccgagaa 


gcaccaggag 


aagctgcagc 


tggagcagtt 


ggccgaaaag 


gaggcactgg 


aqcaqqtqqc 


cgtggcctgg 


gacctctgga 


gcaccctctg 


gcggcaggac 


caccaggagg 


ggccctcacc 


gcctgggctg 


gggggcgccc 


ccttgcaggg 


ccacttttat 


gagcgctgca 


tccqqqqqqc 


cctggaaggc 


ttcgtggatg 


acttgctgga 


cgacatggag 


gtggaggact 


tcattggcgt 


caggccactg 


ctgtgccacc 


ttttcgtgcc 


cccagagctc 


tggtgctccg 


gccgctcagt 


caaggtggtc 


cgcgccgatg 


gggacacatt 


ggaagacatg 


ctgtgtctcc 


tgcacggcag 
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gtacacacaa 


gaagccagaa 


gatatttttt 


120 


agaagctatg 


ggagatccag 


gcatggtttt 


180 


ggtctcttca 


aagcaaaacg 


ggatcaggat 


240 


aataaggaaa 


tttgccccga 


ttctacaaat 


300 


tgagggattg 


ggggtagctc 


caggctgctg 


360 


tggagtgccc 


cgcaactgcg 


tctagggaga 


420 


ccagctcaca 


ctggaaggct 


tcctataggt 


480 


caccaggaat 


taaccctggt 


gggcaaaggg 


540 








550 


agtgccgggc 


cgcaagctcc 


accgcagccg 


60 


ctgcgcgcaa 


agcctgtgct 


ggagccgtcc 


120 


aagccaatcg 


aaagctccaa 


ccatggccat 


180 


gacggccatc 


atcaaccacc 


cgctgctgtt 


240 


cgaggaggag 


atcatccgca 


agatgcaggc 


300 


gcgcctggag 


gaggaggtgg 


ctcggctggc 


360 


ggaggagggc 


aggcagcaga 


acgagacacg 


420 


catgatcctc 


ttcctgatga 


tcgaggtgtg 


480 


tgagtgcctg 


ggcggtgagg 


aggatgagct 


540 


cctcaccctg 


cccaacaagg 


ccacgcttgg 


600 


cacggccgat 


gcagcccgta 


cccgggagtt 


660 


agccctgagg 


agcctctgca 


accgggacac 


720 


ggacagcatg 


tacgagaact 


ggcaggtgga 


780 


cttcacaccc 


cccgagccct 


accgcttcca 


840 


gcccctggat 


cgccagggct 


acggccagat 


900 


gagctgcatc 


tgcggcaaga 


ccaagctcgg 


960 


gaacagcatg 


gcgcctccct 


gcggcgacat 


1020 
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ggagaacctg 

3 3 3 3 


ctgtgtgcca 


cagattccct 


gtacctggac 


gttccagacg 


gccctcacca 


gagcctggaa 


gggcatcgcc 


ggcctttggc 


cagctggaca 


gcccggggtc 

*J m/ W* W 1 


cctgaagatc 


catgcccttc 


aacctgattc 


ctgtgatcca 


gtgtgatqac 

3 w 3 •»3****3*— w 


ccaccttccc 


aQQQaqccct 


ctgagggcac 


cccagcctcc 


ctttgctgtc 


tatgagcgac 


acttcctcag 


gacgacacta 


ctgccacctc 


agctgcctgc 


agatagcatc 


cttcctgctc 


cggtcccagc 


qqgctcagca 


gctaccacct 


gaagacggcc 


ccqqcaqgcc 


qccqactqqa 


agqcqgggca 

33 3333 


gctggacgct 


cttcctggag 


aacracrcttQC 


tccagaagaa 


gctccaccac 


cfatacctaaq 


qccatqqqac 

3 w w cb. 3 3 w. w 


tccctgaggc 


cqtqctcaqq 

^3 w 3 33 


ccggcccttc 


gtcctgcagc 


gaagccttta 


ccgtaagaca 


gctcaagaat 


gccccagcgc 


tcattagcga 


gtattcccta 


taccccaaaa 


agctgacgtc 


ttttacagaa 


tgtgggatcc 


ccctcacgtt 


cacacccctg 


qtqqcatctq 

3 WJJWW.WW w^ 


ccagccctgt 


tcqtqqqaqc 


cqtqctcaqc 


ctgccaggaa 


gccaagccct 


tttcaacggg 


aagc t gg 1 1 1 


gcttcatacc 


attgggatct 


cfCffcttaqqqa 


ctgatccct t 


gcagtttact 


tctggatcac 


gcagaacacg 


ctgtggaccc 


tgagttagag 


acaatgcaaa 


tttttgaatc 


ccagatccag 


tctgtacttg 


aatatgagca 


acagggaacc 


gtgttaagac 


ccagcacccc 


tattcccagg 


gcagccaaag 


cactaacagg 


gacagatatg 


ggaatgtcca 


caatagtggt 


cccaccatgg 


ctgccacttt 


tttatactat 


aaafc t ccracrcr 

•••*•* w w w 3 "*3 3 


cccgagtgac 


taacgtctct 


gtcacacgga 


gagaaacaca 


attagccact 


t tttcagcta 


cacttctcac 


tcactcaggt 


gcaccccctt 


ctgctgtcct 


ttccccaacg 


gggtatttgc 


cacactgggt 


gccagctcag 


cagcccccca 


agctggtctt 


tctgactatc 


attgtggtag 


ggggaggaca 


acctggaaaa 


agagacacac 


ggggtgactg 


tggcaaagga 


tcacttcttc 


attggcaacc 


acaaggcacc 


tgaggccatg 
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acgatgcagg 


tcatgaagtg 


1080 


cacaagtacg 


agttcgacct 


1140 


aagttccgtt 


cagggaagtt 


1200 


tcggacctgt 


actttgtctc 


1260 


agcacagact 

W* «J 


ggctcctgtc 


1320 


aaggcactgc 


ccgagggcgc 


1380 


tccaagcaga 


gccgcctgac 


1440 


ctactgcacc 


tcctactcct 


1500 


cgtctgcacg 


agttgctgtg 


1560 


ttcttcatcg 


gcaaccgcaa 


1620 


gccgagcccc 


tcaacctctt 


1680 


ctggactcct 


tctatgagat 


1740 


catgtcccct 


cagaccagcc 


1800 


tcgagctaag 


atgagggcat 


1860 


tctggggaca 


aggcgggctt 


1920 


acagtgcaga 


ggaaacagaa 


1980 


gctggtaaag 


ctgttatttg 

w* 


2040 


catgaatggc 


caagatggtg 


2100 


tgttggattg 


ggtgtaattc 


2160 


gaggatctac 


aagaatgctg 


2220 


agcttctggc 


ctgaccatct 


2280 


cctttgatcc 


gcatcctgca 


2340 


ttggagaaaa 


gacc fctgtafc 


2400 


aatgggtact 


tggtqqcata 


2460 


tcagctgcac 


cctacacttc 


2520 


tactgggtcc 


cgagcgtggt 


2580 


cctctcttta 


ttctctccaa 


2640 


gatgctaaag 


gtggaagctg 


2700 


cagctggaaa 


agaaactcta 


2760 


gcactcccag 


aggctgtgcg 


2820 
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cagagccaag cctctcaacc 
gacagtagac tccttcgatg 
tgcagtctgg cccagttcag 
caggttgtcc tcagggtccc 
aacctagctc cagtctgcag 
actgcacaga acccacccct 
aaagagcctg gcagaagctc 
gggatcagga aggctctttt 
accgttactt cagttaccct 
agggtttaag ccggccatgc 
accccacaaa ctgaaggaag 
tcagaaagct tgagaagatg 
cggctgtgat cagggcaacc 
tgaacatgtg cttgggcctg 
accagctcaa gtgtgtccct 
ccctcctcat tccatcagat 
gcccatgcct ttccccctct 
caaaaaacag cctccagaag 
tccaggagaa agttcactga 
cagaactcgg tggggaaacg 
cagtggggct cttggtgttt 
caaggagtaa aaatggtcac 
gggcagagtt ggggacaggg 
tttttattct tacactttca 
gtctttgagg caacttgtgt 
cgtagggtgg attccagcct 
ggagctggaa ctaacaggag 
t 
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tcttctggcc 


ctgcgtcctg 


cagcgaagtc 


tctgctgtaa 


2880 


aggtgctcaa 


aaatgctacc 


cggggtggtg 


gtgtctggct 


2940 


agaaagttgc 


agagatcagg 


ggccaaggat 


gtcatagccc 


3000 


aatcctaggg 


cagggtgtgc 


atggaagcaa 


gaactatgga 


3060 


gctctgagcc 


cctagttcct 


cactccagcg 


gggctccctc 


3120 


tctgtgtggg 


cactgctgac 


cacacagatg 


acccagaccc 


3180 


tgtggttgga 


gctgggctcc 


gtctccaggt 


ctggttcagg 


3240 


ccacctgtgg 


cttcactggc 


cctttgagat 


ttcctatctc 


3300 


tgcagggggc 


cagggagtca 


agaatatacc 


gtgttcctcc 


3360 


cttcccgaga 


gcataaccaa 


cttgacaggg 


gtgcccagtt 


3420 


gagatccttc 


ccccatcccc 


aggagtgctc 


tcaaccagcc 


3480 


gaccctttgc 


ccaccagggt 


taattcctgg 


tggggcagct 


3540 


aaacctatag 


gaagccttcc 


agtgtgagct 


ggaattagac 


3600 


cctctcccta 


gacgcagttg 


cggggcactc 


cagggaatga 


3660 


aacagcagcc 


tggagctacc 


cccaatccct 


cacagcctga 


3720 


gcatttgtag 


aatcggggca 


aatttcctta 


tttatttatg 


3780 


tcatcctgat 


cccgttttgc 


tttgaagaga 


ccccagtaac 


3840 


ccaaaaccat 


gcctggatct 


cccatagctt 


ctcctttgct 


3900 


aaaaaaaata 


tcttctggct 


tcttgtgtgt 


acagagacaa 


3960 


ggaatctttt 


ctgcaccaaa 


gctgcttcta 


aagcagaaag 


4020 


catgctgcct 


tatttatatt 


aaaggaagaa 


ttaaatcttg 


4080 


tgtttggttt 


ttaaacttca 


ggaaatctgt 


gaggctccca 


4140 


gggtggattt 


cctattgaca 


aagcagaagc 


tttcactccc 


4200 


ggagacttta 


aaaaataaca 


aaacagaaca 


tttctaccct 


4260 


gctgcccgca 


gggctccaga 


aagggccttt 


gaaagctggc 


4320 


aagaccttat 


ttcacagagc 


taatgtcagc 


gagagatata 


4380 


cgtttggttt 


tagagaaaga 


ggaggtgtct 


tggataactc 


4440 
4441 



<210> 116 
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<211> 120 
<212> DNA 
<213> Homo sapien 



<400> 116 

acttacaaac tgtggtcatg 



ttaagataca gaacatcagc caagaaccag aaccaactcc 



60 



ttaacataac atagagtggg 



ccctgcccac cccccaatac aatacattta agataagctt 



120 



<210> 117 
<211> 1977 
<212> DNA 
<213> Homo sapien 








<400> 117 
ttcacccggc 


cctcgcctgt 


caccttcaca gagaaggggc tgcctgcagg 


aagaaagcac 


60 


ggcccacgcc 


cctccagtca 


catactgcct gtgggccctg gtgtagtgag 


gggggctgcc 


120 


gaggcactgc 


tgcactcacc 


aggcacgtgc tggtcggcaa acttgatgtt 


gatgatgtag 


180 


ttgcctggct 


ctgtggggca 


gtaqgtcacc tggcgggaca ggtgcctggc 


tgctacaaac 


240 


catgcaatga 


gccatgcccc 


gccctggaca cccccgccca gcatctgggc 


ctccacgctt 


300 


gggaccgtgg 


gagcggccaa 


cagagctatg tctggagaca tatgataaac 


cacctcagcc 


360 


cccaccaagc 


cgccgcaccc 


gtagaccaga ccccaaggac cctggccacc 


atgggccaga 


420 


gagcattacc 


ttcatctctg 


gctctgctga gccggccctt gagtccccca 


cctgctgcct 


480 


gctctggcga 


ccctgggtgt 


gggagtggtg ccgggctgcc ttctgcttcc 


gcccgctgcc 


540 


gggattgcct 


ccagcgctgt 


qqaqqcctqt qtqcqqqqat qcaqcccctg 


cctgtctact 


600 


gaggactccc 


actgagggga 


ctgctgaagc caactggtgc caaggagcac 


aatggagtgc 


660 


cccccagccc 


tgatcgtgca 


ccccccaaac cqqcqqqatq qccaqqcqqq 


ctgcaagtca 


720 


accatgggca 


gcagcttcag 


ctaccccgat gttaagctca aaggcatccc 


tgtgtatccc 


780 


tacccgagag 


gccacctccc 


cagcccctga tgcggactcc tgctgcaagg 


agccactggc 


840 


cgatccccca 


cccatgcgag 


cacagcctgc ccagcacctt tgccagtagt 


cctcgtggct 


900 


ccgaggagta 


ctattctttc 


catgagtcgg acctggacct gccggagatg 


ggcagtggct 


960 


ccatgtcgag 


ccgagaaatt 


gatgtgctca tcttcaagaa gctgacagag 


gctgttcagc 


1020 


gtacaccaga 


tcgatgagct 


ggccaagtgc acatcagaca ctgtgttcct 


ggagaagacc 


1080 


agtaagatct 


cggaccttat 


cagcagcatc acgcaggact accacctgga 


tgagcaggat 


1140 


gctgagggcc 


gcctggtacg 


cggcatcatt cgcattagta cccgaaagag 


ccgtgctcgc 


1200 


ccacagacct 


cggagggtcg 


ttcaactcgg gctgctgccc caaccgctgc 


tgcccctgac 


1260 


agtggccatg 


agaccatggt 


gggctcaggt ctcagccagg atgagctgac 


agtgcagatc 


1320 
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tcccaggaga 


cgactgcaga 


tgccatcgcc 


cggaagctga 


ggccttatgg 


agctccaggg 


1380 


tacccagcaa 


gccatgactc 


atccttccag 


ggcaccgaca 


cagactcgtc 


gggggcaccc 


1440 


ttgctccagg 


tgtactgcta 


acccctgcca 


ggcccagctg 


ccacaccctt 


tctgggagaa 


1500 


gcatggccta 


cagaatgaag 


agggggacca 


ggaacccctg 


tgggagaggc 


ttagacctga 


1560 


agcagtgccc 


actctggctc 


ctcctgcctt 


ggctgactgg 


gttcctggac 


catgtgcatt 


1620 


tcactgggcc 


atgggatcta 


catctccttg 


catccccagc 


tggtctgatc 


cctgccaggg 


1680 


ccccttcctt 


cctgctcatg 


gtcttcaggt 


ggcctgatca 


tggaaagtaa 


ggagttaggc 


1740 


attaccttct 


gggagtgaac 


cctgactcca 


tccccctatt 


gccaccctaa 


ccaatcatgc 


1800 


aaacttctcc 


ctccctgggg 


taattcaaca 


gttaaaagaa 


gcttatctta 


aatgtattgt 


1860 


attggggggt 


gggcagggcc 


cactctatgt 


tatgttaagg 


agttggttct 


ggttcttggc 


1920 


tgatgttctg 


tatcttaaca 


tgaccacagt 


ttgtaagtac 


ctcgctcgcg 


accacgc 


1977 


<210> 118 
<211> 182 
<212> DNA 
<213> Homo sapien 












<400> 118 
catgttctct 


aatgcatgct 


ccgacggcgc 


atgtgatgga 


tcgcggcgag 


tgaaaagcaa 


60 


gagccagaat 


taagaggttg 


ggtcagtctg 


gcagtgagtt 


catgcattta 


gaggtgttct 


120 


tcaagatgac 


taatgttcaa 


aaattgagac 


atctgttgcg 


gttctttttt 


tttttttttc 


180 


cc 












182 


<210> 119 
<211> 875 
<212> DNA 
<213> Homo sapien 












<400> 119 
ggtcgcggcc 


gaggtaccac 


attggtccac 


ttgacactaa 


ccaatcgatc 


attttttttt 


60 


aatcaagaaa gctagattct 


atcagataaa 


atcactgctt 


ctaaagagtt 


taaatctagt 


120 


tagaaaaagt 


tatagaaatg 


tttgcaaaga 


taagtaacag 


atagagtcag 


tagaggataa 


180 


gatcaaaaac 


aaaaccaagc 


aaaagatgag 


ttcaggggag 


tttgccatca 


agtggcaaaa 


240 


ctgacttact 


tagggaagaa 


agttataaaa 


caggaaaata 


tgagatgaac 


cttgagtgat 


300 


gtggaagatt 


tagataaatg 


gaaaggaagg 


agaaaatgga 


gttctttagg 


tggttgtaat 


360 


tggaggagga aatgaataca 


cacatcttgt 


tgacttaaac 


ccagacattc 


agcagctctc 


420 
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tatacatatc 


tggaaaagac 


tgcacagtca 


cctcctgtct 


ctcaccccag 


gtattactta 


/OA 


gaattattat 


catatttccc 


ttcctttaaa 


gtaagtaagg 


gtgatggtga 


caatatggag 


Ofi U 


aactatgatt 


tttccattaa 


cctaataata 


attgtattta 


ttgagttctg 


ttaagcattt 


c f\ r\ 
bUU 


tacatattaa 


ctcacttaag 


cctttcaaca 


gcctgggcaa 


aataggtatt 


attatccccc 


r C r\ 


attttacagg 


caaagaaaac 


tgaggtttaa 


ggtaactgtg 


ccgaaagtgc 


catataacag 


720 


ggctcacatt 


cagtatctgc 


agttgcaagc 


tcatgatcta 


tagtgccaag 


ttgcatatgg 


780 


tagtccatgt 


cacattatta 


cccttttata 


tccctggaat 


tttcatgggc 


aaccattagt 


840 


attcatttta 


atatcactaa 


acttccagcc 


ctgat 






o /b 


<210> 120 
<211> 987 
<212> DNA 
<213> Homo sapien 












<400> 120 
ggtcgcggcc 


gaggtaccac 


attggtccac 


ttgacactaa 


ccaatcgatc 


attttttttt 


60 


aatcaagaaa 


gctagattct 


atcagataaa 


atcactgctt 


ctaaagagtt 


taaatctagt 


Tin 


tagaaaaagt 


tatagaaatg 


tttgcaaaga 


taagtaacag 


atagagtcag 


tagaggataa 


i on 
loU 


gatcaaaaac 


aaaaccaagc 


aaaagatgag 


ttcaggggag 


tttgccatca 


agtggcaaaa 


Z4U 


ctgacttact 


tagggaagaa 


agttataaaa 


caggaaaata 


tgagatgaac 


cttgagtgat 


300 


gtggaagatt 


tagataaatg 


gaaaggaagg 


agaaaatgga 


gttctttagg 


tggttgtaat 


360 


tggaggagga 


aatgaataca 


cacatcttgt 


tgacttaaac 


ccagacattc 


agcagctctc 


42 0 


tatacatatc 


tggaaaagac 


tgcacagtca 


cctcctgtct 


ctcaccccag 


gtattactta 


4 80 


gaattattat 


catatttccc 


ttcctttaaa 


gtaagtaagg 


gtgatggtga 


caatatggag 


C A f\ 


aactatgatt 


tttccattaa 


cctaataata 


attgtattta 


ttgagttctg 


ttaagcattt 


oUU 


tacatattaa 


ctcacttaag 


cctttcaaca 


gcctgggcaa 


aataggtatt 


attatccccc 


660 


attttacagg 


caaagaaaac 


tgaggtttaa 


ggtaactgtg 


ccgaaagtgc 


catataacag 


720 


ggctcacatt 


cagtatctgc 


agttgcaagc 


tcatgatcta 


tagtgccaag 


ttgcatatgg 


780 


tagtccatgt 


cacattatta 


ccccttttta 


tatccctgga 


atttttccat 


gggcaaccat 


840 


tagctatttc 


atttaataat 


cacctaaaac 


ttttcagtct 


tctgattaaa 


attacgctgg 


900 


agtgatagaa 


tgtattttca 


tgatagaaat 


tgggaaaaaa 


aatggggaat 


gaagtttatc 


960 


agcatttcag 


acttgttttt 


ttttttt 








987 



<210> 121 
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<211> 
<212> 
<213> 



295 
DNA 



Homo sapien 



<400> 121 

cgtggtcgcg gcgaggtaca 



taaagctatg attttgcatg ggaactcatt taaattaaag 



60 



ttaccaaggg aattccatat 



gtaaactttt cctggaacta tgaacttctg agacttcaga 



120 



aagattttga ctgtgtaggt tattctgtgt tgcgctatgg ggtccgcctt tgtgttgctg 180 
agctggagag cgtgcctctg ctgccgcgct gtgtcagtag tggggattgc acttttgttt 240 



<210> 122 

<211> 3210 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (954) . . (1013) 

<223> a, c, g or t 



<220> 

<221> misc_f eature 

<222> (1958) . . (2054) 

<223> a, c, g or t 



<400> 122 
tagtgtaaat 


tctactatga 


tgagtatttt tatatgtaaa gctttctctg 


aatttaggat 


60 


ttttttaagc 


ctagatctca 


ggacattgaa agactatgga aaaatagaat 


acagttttaa 


120 


cattcttgac 


atagttgaca 


attgacactt gctttccagt gatgtttact 


agtacagttt 


180 


tcaataactg 


tttactgcat 


atattttatt tggaaaatag tgttgttgtt 


tttttccttt 


240 


ctggtaattt 


ctttctatac 


tttgcatctc ttatttttat tgagcatgga 


cttgagagac 


300 


tttaagaagt 


tttgtaggga 


aaatacacac cgagaaaata aaaactctag gaaaagttca 


360 


caatgtttta 


aaaatagttt 


tgtacataaa gctatgattt ttgcatggga 


actcatttaa 


420 


attaaagtta 


ccaagggaat 


tccatatgta aacttttcct tgaactatga acttctgaga 


480 


cttcagaaag 


atttgactgt 


gtaggttatt ctgtgttgcg ctatggggtc 


cgcctttgtg 


540 


ttgctgagct 


ggagagcgtg 


cctctgctgc cgcgctgtgt cagtagtggg 


gattgcactt 


600 


ttgtttccag 


ctacaggcca 


aatttgagca ccttaagaga cttcaccaag aagagagaat . 


660 


gaagcttgaa 


gaaaggagag 


acttttggaa gaagaaataa ttgctttctc 


taaaaagaaa 


720 


gctacctccg 


agatatttca 


cagccagtcc tttctggcaa caggcagcaa 


cctcaggaag 


780 



ccagctacag gccaaatttg 



agcaccttga gagacttcac caagaagaga gaatg 



295 
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gacaaggacc 


gtaagaggta 


agaggcccag 


ccaccctcag 


gttgctgccc 


tgcccacact 


840 


tgtcacattg 


ccctgtcgga 


tcttcctagc 


gcttcttact 


tctgcttcct 


tatgctttgt 


900 


cttccttctc 


caactaaatg 


tcagctcctg 


gagaggagag 


accttgtccc 


tctnnnnnnn 


960 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnntttgcca 


1020 


aaatgaatga 


atgaacaaaa 


taaaaataca 


tcacagtgtg 


aaaaggatca 


gttagctggg 


1080 


ggaaaatggc 


tgctcttatt 


gaatagccca 


gctgtggccc 


tagtacacag 


ctaatggctg 


1140 


ctgacctacc 


atgagttatt 


ttgaatttca 


tgtctaaata 


aagctgtgcc 


ctttgttggg 


1200 


gaattataga 


cataagatag 


tattttattg 


aattaattct 


taggttcagt 


tttttagaac 


1260 


aaattctact 


gatactgatt 


ttgtgagaat 


ttttatttta 


ataagggaac 


caggctgtcg 


1320 


attcgaactc 


ctgtgcattg 


atgtcagagc 


ttgtgaaacc 


aatggtggac 


gtaaagacgc 


1380 


agagaaaggt 


agagagggcc 


acagtgcaag 


ggggcagcca 


tgagtagcat 


gtggatgata 


1440 


atcgcaaggc 


gtgtaggttt 


ggggtatgtg 


ttaaaatcag 


ttacccaagt 


tttagaaaat 


1500 


aagtttattt 


ttcctttttg 


ataatttcct 


tctgtacttt 


gtgtcccttg 


ttttatttct 


1560 


cccacttaag 


tatccctgtc 


ggtttaacta 


tttggctctt 


gaggaaattc 


atcatgcctt 


1620 


gttctattca 


gaatgctcat 


ttgggggact 


ctgcctcagt 


ttactctagc 


tacctctgtg 


1680 


ctagcagcgt 


tgcatgctcc 


catgttcccc 


acagagcaac 


acatctggtt 


taatgctctc 


1740 


actgcatcta 


tttcagaccc 


attttcagaa 


tattgcaact 


caactttaaa 


ataggaaaac 


1800 


ataaccttgg 


cccattggtg 


gctttcaatt 


tttaaatatg 


gagaagtaaa 


ccacaggact 


1860 


ttggcagatt 


ttattggctt 


atatatttta 


caggtatttt 


tgtttgtttg 


ttatgttttt 


1920 


ttggaaaaga 


ggaaggaatt 


ctcatgtatg 


atctttcnnn 


nnnnnnnnnn 


nnnnnnnnnn 


1980 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


2040 


nnnnnnnnnn 


nnnnttattt 


ttgaaagaat 


aaatagtata 


aaagctgtat 


tttattcaag 


2100 


cattgaattt 


agaaagataa 


actataaatt 


tattactgct 


tttcaacatg 


ctttggattt 


2160 


taaagctaat 


ggatctttta 


ttaatacctt 


ttttcctttc 


atcttaaccc 


agtgttacta 


2220 


aaattagatt 


tcccattttt 


tcccctatat 


gaaagataat 


ttacatttac 


cttgtaaaaa 


2280 


ttatcaccct 


gctccacttg 


agaaccctgc 


tgttgtttgc 


aaaatcagga 


ccaagcccat 


2340 


attttttgag 


agctctctag 


aagaatttat 


cttcaagaaa 


atatggtttg 


gtgtttttgg 


2400 


ttttttcttt 


tctttttcaa 


agtcccatgg 


ccagatatcc 


tataatatac 


tagatgcatg 


2460 


tttgctaatt 


tttacttgaa 


ttttttttaa 


attgtaccaa 


tcaaaaggtt 


ctttttcttt 


2520 
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ccagctccaa 


ttttttgtaa 


aacagaagtfc 


113 
ccagagcaca 


gaaggccacc 


atcacaagca 


2580 


aactttatta 


aaaaaaaact 


agaagtgtgc 


tttgatxtcg 


CCgtCatt uy 


ttttatcact 


2640 


tctatatttg 


gtgaacagcc 


acagttactg 


atauttatgg 


aaaagtact. t. 


tcaagtacaa 


2700 


ggtcaataca 


taagccagag 


tgaatgatac 


tacaagttga 


gcafcctctaa 


ttcaaaaatc 


2760 


tgaaatccag 


aagcttcaaa 


atctgaatct 


ttttgagcac 


tgacttgacc 


ccacaagtgg 


2820 


aaaattcccc 


acccgacacc 


tttgctttct 


gatggttcag 


tttaaacaga 


ttttgtttct 


2880 


tgcacaaaat 


ttttgtataa 


attactttca 


ggctatatgt 


ataaggtgga 


tgtgaaacat 


2940 


gaattatgta 


attagagtcg 


ggtcccgtag 


tgtatatgca 


gatattccaa 


acctgaaatc 


3000 


caaaacactt 


ctggtcccta 


agcattttgc 


actaagggat 


actcagcttg 


tacctatata 


3060 


ttcctatata 


ttcactgttg 


ttagaaatgt 


ttaagttgct 


gttctgtgat 


gaatctaaat 


3120 


cttttctctt 


gctaccaagc 


tattgtcact 


gcagtgcatt 


ataccaaaga 


gcgaagtcag 


3180 


tgccactgaa 


aatacagaac 


cattatatcg 








3210 


<210> 123 
<211> 662 
<212> DNA 
<213> Homo sapien 












<400> 123 
gcgtggtcgc 


ggcgaggtac 


ctgcatctaa 


aatagaattt 


atttttcttg 


agacacgtct 


60 


ttcaggcaga 


ggaaaggtaa 


tgggcacaca 


aaccttgatc 


atcutcrtac 


aaaacaaggc 


120 


agcatccaca 


tttcaatact 


gtagcatatg 


aaataatttc 


tcaattccta 


ctctgtcata 


180 


gcttaaatga 


cttactgatt 


aaaatatgag 


taacaataat 


tctggagctg 


aaagacattc 


240 


tcaagccaaa 


aggtaacatt 


aaacaatgtt 


ttactttaaa 


tttaatgtca 


tgtagggtca 


300 


ctaaaaacaa 


aatgtaaggg 


ataaatgtaa 


gtgcataaaa 


cagagataat 


accaatttcc 


360 


acaaatacaa 


cttttattat 


ccaaaatcat 


cttggaagga 


ctttttctaa 


tatgcccatt 


420 


ttctaaataa 


gattaaccat 


ttgatgggaa 


tatttccaat 


tgtatcacct 




480 


ctttttcttc 


atcaacttag 


ggcctgggtt 


gtaggcactg 


tggctctttg 


ggatagttaa 


540 


atgggtgcaa 


tttggctgag 


accgcccctg 


tcttaaatgg 


cccaggctac 


agggcttgcc 


600 


aaatggcttg 


tggtttgcaa 


cctcttcctt 


tgatatccat 


tgaagaacac 


atgtcctacc 


660 



ct 662 



<210> 124 
<211> 1845 
<212> DNA 
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<213> Homo sapien 




<400> 124 
tgtcagattg 


atggcctgtg 


ctgttgagta 


gtggtcgggt 


gtggtgagtg 


gctgggtggg 


ggtttgtgtg 


tgcgtgtgtc 


gcggcggtgt 


cgttgttaag 


aggctegggt 


gtcggcgggg 


ttcggtggtc 


cggcgtggtt 


qtqtggatat 


ggtgcgggcg 


tggcttatgt 


tgtgttttaa 


gcggtgtggt 


ggtgtgtatg 


tqtqqtqttt 


ggccgcggta 


ggagtcegge 


catctcctga 


attgatatca 


attcttagaa 


taaaatgcag 


ttgagacacg 


tctttcaggc 


aqaqqaaagg 


tataaaacaa 


ggcagcatcc 


acatttcaat 


ctactctgtc 


atagcttaaa 


tgacttactg 


ctgaaagaca 


ttctcaagcc 


aaaaggtaac 


tcatgtaggg 


tcactaaaaa 


caaaatgtaa 


aataccaatt 


tccacaaata 


caacttttat 


taatatgccc 


attttctaaa 


taagattaac 


cctccttcct 


tgactttttc 


ttcatcaact 


ttgggatagt 


taaatgggtg 


caatttggct 


tacagggctt 


gecaaatgge 


tttggtttgc 


acatgtccta 


ccctcttgat 


tgeaagctte 


gataagttct 


aaggtccttc 


cagttccaaa 


agagattgga 


gattactgag 


gaatctcttc 


era t* t* c t a t" era 




aaaattgaaa 


gaagttttaa 


aaaataatgg 


agtgggtaga 


acacgtgtca 


gttatcacct 


ttataagtat 


aaccctaaac 


taagtgagat 


attactgggt 


cctttaaaga 


tggaagaatc 


tcaaaagtat 


taatgacacg 


tctgatcatc 


accctgagtt 



PCT/US01/43612 

114 



gttccgtctg 


attgacgtgg 


egatteggat 


60 


ctttcggtgg 


gtggtgtgtg 


tgcggccggt 


120 


gggctgtggg 


teggaeggag 


ggcctggtgg 


180 


gtgtgttctg 


ttgggggctc 


gtcgtgtccg 


240 


cgtggtgtcg 


cgtggttgcg 


tggctgeget 


300 


tgcgttgctc 


gtggggtcgg 


tggggtcgtg 


360 


gttcgcatgg 


ggcgtgttgt 


atgtgttggt 


420 


tttattcttt 


tacagttttc 


tatgttttga 


480 


tacctgcatc 


taaaatagaa 


tttatttttc 


540 


taatgggcac 


acaaaccttg 


atcatcttct 


600 


actgtagcat 


atgaaataat 


ttctcaattc 


660 


attaaaatat 


gagtaacaat 


aattctggag 


720 


attaaacaat 


gttttacttt 


aaatttaatg 


780 


gggataaatg 


taagtgcata 


aaacagagat 


840 


tatccaaaat 


catcttggaa 


ggactttttc 


900 


catttgatgg 


gaatatttcc 


aattgtatca 


960 


tagggectgg 


gttgtaggca 


ctgtggctct 


1020 


gagaccgccc 


ctgtcttaaa 


tggcccaggc 


1080 


aacctcttcc 


tttgatatcc 


attgaagaac 


1140 


tatctgetat 


tggctatcag 


gagcataaca 


1200 


agecattgae 


caaccctgtg 


aggcactatg 


1260 


caataacata 


tgatatataa 


aggtaattgt 


1320 


acaaaaccaa 


tcttgtttaa 


actattaaat 


1380 


tgtattgect 


atcaaattaa 


aattcaggga 


1440 


gatctataag 


gttgaattaa 


ctaaagatct 


1500 


tactaaatat 


agatgaatgg 


cttccttttt 


1560 


gectgaatea 


gaagcagagg 


actcatgttc 


1620 


ccataaagct 


atagcatatt 


gtagtagaaa 


1680 
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gtatatcagc 


atgtgagtgg 


agagatttaa 


115 
gggtatagat 


ctctacgtat ttccagctct 


1740 


gtcaccaact ggtgacatga 


ctttgggcag 


aaactgtctt 


attgattccc tcgtgccgaa 


1800 


ttcttgctcg 


aggccaaatt 


ccctatagcg 


aacttattga 


acaga 


1845 


<210> 125 
<211> 306 
<212> DNA 
<213> Homo sapien 










<400> 125 

gcggcgccgg gcaggtacat 


aagtgtcact 


cgcctaatta 


aaaacattga gtaaaccaag 


60 


tttttatata gactaccctt 


gccatatgat 


gctctttttc 


tctaataata tgcagtttaa 


120 


atcctgagga 


atcaatgccc 


agcatttcac 


cacatctgaa 


ctctgtgtgg gcattcttca 


180 


ctcgcctaca 


aggggtaaac 


aaggctacca 


gaacttgaat 


ttgacttata gggagctacc 


240 


caggaagggg 


aaagcccttg 


ggactttttc 


caaaacaatc 


ttctatttga actgttcatc 


300 


agccaa 










306 


<210> 126 

<211> 2049 

<212> DNA 

<213> Homo sapien 










<400> 126 
aaaaaaaaaa 


gtaaaacaaa 


aataaagtct 


atgcccatta 


agacgtcttc taattcagtt 


60 


gtgattgtct 


gctcctactt 


aaaaaaatat 


ttaagcttga 


tgtttaatta ttccctttca 


120 


gcaaatttgg 


atcagaaaat 


taaagtatgt 


gacaagatca 


ggtcaccttg aatttccaca 


180 


caatctcaag 


acactgaata 


gcaaaaaagt 


aacattacac 


agtaatgatt aggatatttc 


240 


cttagacttt 


gctggatctt 


tggtcttaag 


gtaacatgta 


aaagtagtga agcctttcct 


300 


ttcatggccc 


tgtgcaatgt 


aacggttttc 


tgcctcctct 


tcagctggaa gcgttagtgg 


360 


tagtatgggc 


acagaatata 


tgtacactgg 


cgatgctgac 


catgcctccc aggtaccctg 


420 


gctctgggtt 


ccttgaccta 


gggaacaaga 


ttggatgagg 


cagatctttg agcccatgtg 


480 


actatagaat 


ttgctgatga 


tataatttta 


caataacaat 


ggataggaat tttacctctc 


540 


tttttattag 


tttaatatta 


tttaatatta 


tgtacataag 


tgttcactcg cctaattaaa 


600 


aacattgagt 


aaaccaagtt 


tttatataga 


ctacccttgc 


catatgatgc tctttttctc 


660 


taataatatg 


cagtttaaat 


cctgaggaat 


caatgcccag 


catttcacca catctgaact 


720 


ctgtgtgggc 


attcttcact 


cgcctacaag 


gggtaaacaa 


ggctaccaga acttgaattt 


780 


gacttatagg 


gagctaccca 


ggaaggggaa 


agcccttggg 


actttttcca aaacaatctt 


840 
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ctatttgaac 


tgttcatcag 


ccaaagtagt 


aaagatggga 


agataaaggt 


aacactggcc 


actgaataaa 


cacagcctag 


gtggcctggg 


tagttcctgg 


tctaccttct 


gacccttttt 


catttactta 


caagttccca 


gcttttaggg 


tgaattagat 


atacatctta 


gaagtaatga 


cacaaaccag 


aattctttct 


tgttgaacaa 


cacttttctc 


tccctctctt 


ctcctcttcc 


gaaatacctc 


cagaggcgtt 


ttattgtgtt 


aattttgctc 


aggcaaccaa 


ggacagagta 


gccacctgta 


cataaaagtg 


taattattta 


ctttgtacag 


tgttttaagt 


tctgttttaa 


gtaataaaat 


attcattccg 


catgcaaaaa 


acaaacagaa 


ccaaaagata 


gaaaccaaac 


aaacaaccac 


acaaacagca 


cacacaaccg 


acagcacaac 


aagggacaaa 


acacacacgt 


atctaaagac 


aggcgacaac 


aacatagcag 


ctatccagcg 


gagatcagat 


cacagaagcc 


atgacaccgc 


agggcggata 


aaaacactcg 


ctaacaaaca 


agctagtaga 


tcacaagtac 



aagagggag 

<210> 127 

<211> 286 

<212> DNA 

<213> Homo sapien 

<400> 127 

acatctggtt gcggagaaag gaaaaacttc 
gttgaatgga atttttactg ttagaggtag 
tccctacaag aggattacgt ggtaagttga 
tcaaaactat atacgttggg cacagaactg 
tcccctttgt gtccccagtg cctaaagcag 
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ccactgaggt 


gacaaagctt 


tcagaaatac 


900 


cacttggggc 


tttgacattg 


gattgggtgg 


960 


cttgagcctc 


acttacttct 


ccttgataca 


1020 


ctaaaatagc 


cagtgtctat 


ttcactaggc 


1080 


aaaaaagagg 


gaggggggag 


catctagttt 


1140 


gctattggca 


gctgttaaat 


cagattcagc 


1200 


gaccaatgag 


ttagatgact 


ttaataattc 


1260 


tgaaatcaga 


gagatgagaa 


actactcttt 


1320 


cctttcccct 


ccaagcagct 


ccctttatac 


1380 


tcggcagaaa 


catggagtgc 


ttttgtatag 


1440 


tttaattttc 


ccatttgtat 


catattaaag 


1500 


aattattttg 


tattttattt 


ttataaccta 


1560 


aaaacacaca 


cacaacccaa 


acaaccaaaa 


1620 


acagcaacag 


acagacacaa 


gagaagacga 


1680 


cagcggagaa 


ccaacaaagc 


caaacgcaag 


1740 


agcacaaagc 


agccgcagaa 


ccgaacacac 


1800 


gccgtagcac 


cgacaaccac 


taaaccataa 


1860 


gacacagaaa 


gaaaagcatg 


tatgcaacgc 


1920 


taggcgggac 


cgccgcagaa 


cgatcgaagg 


1980 


gtgaccacgc 


cctcttatct 


ataaaggagg 


2040 








2049 


tcaaagaaga 


atttgaagtg 


agctttaagg 


60 


agaaatacaa 


tatcataaag 


aaagatgtga 


120 


aagacattaa 


acagtccaac 


ctggtcattg 


180 


aagaccaggt 


gtcaaaggag 


gatggctctg 


240 


tgtttggtgc 


aagctt 




286 
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<210> 128 

<211> 12421 

<212> DNA 

<213> Homo sapien 

<400> 128 



agcccacggg 


atcgcccacg 


cgtcggggca ggtcaaggag gaggctgaaa 


gagcctgagc 


60 


tgtgccctct 


ccattccact 


gctgtggcag ggtcagaaat cttggataga 


gaaaaccttt 


120 


tgcaaacggg 


aatgtatctt 


tgtaattcct agcacgaaag actctaacag 


gtgttgctgt 


180 


ggccagttca 


ccaaccagca 


tatcccccct ctgccaagtg caacacccag 


caaaaatgaa 


240 


gaggaaaaca 


aacaggtgga 


gactcagcct gagaaatggt ctgttgccaa 


gcacaccaca 


300 


gagctaccca 


acagattcct 


atggagttct tgaattccag ggtggcggat 


attccaataa 


360 


agccatgtat 


atccgtgtat 


cctatgacac caagccagac tcactgctcc 


atctcatggt 


420 


gaaagattgg 


cagctggaac 


tccccaagct cttaatatct gtgcatggag 


gcctccagaa 


480 


ctttgagatg 


cagcccaagc 


tgaaacaagt ctttgggaaa ggcctgatca 


aggctgctat 


540 


gaccaccggg 


gcctggatct 


tcaccggggg tgtcagcaca ggtgttatca 


gccacgtagg 


600 


qqatgccttg 


aaagaccact 


cctccaagtc cagaggccgg gtttgtgcta 


taggaattgc 


660 


tccatggggc 

-^j 


atcgtggaga 


ataaggaaga cctggttgga aaggatgtaa 


caagagtgta 


720 


ccagaccatg 


tccaaccctc 


taagtaagct ctctgtgctc aacaactccc 


acacccactt 


780 


catcctggct 


gacaatggca 


ccctgggcaa gtatggcgcc gaggtgaagc 


tgcgaaggct 


840 


gctggaaaag 


cacatctccc 


tccagaagat caacacaaga ctggggcagg 


gcgtgcccct 


900 


cqtggqtctC 


qtqgtqqaqq 


ggggccctaa cgtggtgtcc atcgtcttgg 


aatacctgca 


960 


agaagagcct 


cccatccctg 


tggtgatttg tgatggcagc ggacgtgcct 


cggacatcct 


1020 


gtcctttgcg 


cacaagtact 


gtgaagaagg cggaataata aatgagtccc 


tcagggagca 


1080 


gcttctagtt 


accattcaga 


aaacatttaa ttataataag gcacaatcac 


atcagctgtt 


1140 


tgcaattata 


atggagtgca 


tgaagaagaa agaactcgtc actgtgttca 


gaatgggttc 


1200 


tgagggccag 


caggacatcg 


agatggcaat tttaactgcc ctgctgaaag 


gaacaaacgt 


1260 


atctgctcca 


gatcagctga 


gcttggcact ggcttggaac cgcgtggaca 


tagcacgaag 


1320 


ccagatcttt 


gtctttgggc 


cccactggac gcccctggga agcctggcac 


ccccgacgga 


1380 


cagcaaagcc 


acggagaagg 


agaagaagcc acccatggcc accaccaagg 


gaggaagagg 


1440 


aaaagggaaa 


ggcaagaaga 


aagggaaagt gaaagaggaa gtggaggaag 


aaactgaccc 


1500 


ccggaagata 


gagctgctga 


actgggtgaa tgctttggag caagcgatgc 


tagatgcttt 


1560 
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agtcttagat 


cgtgtcgact 


ttgtgaagct 


cctgattgaa 


aacggagtga 


acatgcaaca 


1620 


ctttctgacc 


attccgaggc 


tggaggagct 


ctataacaca 


agactgggtc 


caccaaacac 


1680 


acttcatctg 


ctggtgaggg 


atgtgaaaaa 


gagcaacctt 


ccgcctgatt 


accacatcag 


1740 


cctcatagac 


atcgggctcg 


tgctggagta 


cctcatggga 


ggagcctacc 


gctgcaacta 


1800 


cactcggaaa 


aactttcgga 


ccctttacaa 


caacttgttt 


ggaccaaaga 


ggcctaaagc 


1860 


tcttaaactt 


ctgggaatgg 


aagatgatga 


gcctccagct 


aaagggaaga 


aaaaaaaaaa 


1920 


aaagaaaaag 


gaggaagaga 


tcgacattga 


tgtggacgac 


cctgccgtga 


gtcggttcca 


1980 


gtatcccttc 


cacgagctga 


tggtgtgggc 


agtgctgatg 


aaacgccaga 


aaatggcagt 


2040 


gttcctctgg 


cagcgagggg 


aagagagcat 


ggccaaggcc 


ctggtggcct 


gcaagctcta 


2100 


caaggccatg 


gcccacgagt 


cctccgagag 


tgatctggtg 


gatgacatct 


cccaggactt 


2160 


ggataacaat 


tccaaagact 


tcggccagct 


tgctttggag 


ttattagacc 


agtcctataa 


2220 


gcatgacgag 


cagatcgcta 


tgaaactcct 


gacctacgag 


ctgaaaaact 


ggagcaactc 


2280 


gacctgcctc 


aaactggccg 


tggcagccaa 


acaccgggac 


ttcattgctc 


acacctgcag 


2340 


ccagatgctg 


ctgaccgata 


tgtggatggg 


aagactgcgg 


atgcggaaga 


accccggcct 


2400 


gaaggttatc 


atggggattc 


ttctaccccc 


caccatcttg 


tttttggaat 


ttcgcacata 


2460 


tgatgatttc 


tcgtatcaaa 


catccaagga 


aaacgaggat 


ggcaaagaaa 


aagaagagga 


2520 


aaatacggat 


gcaaatgcag 


atgctggctc 


aagaaagggg 


gatgaggaga 


acgagcataa 


2580 


aaaacagaga 


agtattccca 


tcggaacaaa 


gatctgtgaa 


ttctataacg 


cgcccattgt 


2640 


caagttctgg 


ttttacacaa 


tatcatactt 


gggctacctg 


ctgctgttta 


actacgtcat 


2700 


cctggtgcgg 


atggatggct 


ggccgtccct 


ccaggagtgg 


atcgtcatct 


cctacatcgt 


2760 


gagcctggcg 


ttagagaaga 


tacgagagat 


cctcatgtca 


gaaccaggca 


aactcagcca 


2820 


gaaaatcaaa 


gtttggcttc 


aggagtactg 


gaacatcaca 


gatctcgtgg 


ccatttccac 


2880 


attcatgatt 


ggagcaattc 


ttcgcctaca 


gaaccagccc 


tacatgggct 


atggccgggt 


2940 


gatctactgt 


gtggatatca 


tcttctggta 


catccgtgtc 


ctggacatct 


ttggtgtcaa 


3000 


caagtatctg 


gggccatacg 


tgatgatgat 


tggaaagatg 


atgatcgaca 


tgctgtactt 


3060 


tgtggtcatc 


atgctggtcg 


tgctcatgag 


tttcggagta 


gcccgtcaag 


ccattctgca 


3120 


tccagaggag 


aagccctctt 


ggaaactggc 


ccgaaacatc 


ttctacatgc 


cctactggat 


3180 


gatctatgga 


gaggtgtttg 


cagaccagat 


agacctctac 


gccatggaaa 


ttaatcctcc 


3240 


ttgtggtgag 


aacctatatg 


atgaggaggg 


caagcggctt 


cctccctgta 


tccccggcgc 


3300 
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ctggctcact 


ccagcactca 


tggcgtgcta 


tctactggtc 


gccaacatcc 


tgctggtgaa 


3360 


cctgctgatt 


gctgtgttca 


acaatacctt 


ctttgaagta 


aaatcaatat 


ccaaccaggt 


3420 


gtggaagttc 


cagcgatatc 


agctgattat 


gacatttcat 


gacaggccag 


tcctgccccc 


3480 


accgatgatc 


attttaagcc 


acatctacat 


catcattatg 


cgtctcagcg 


gccgctgcag 


3540 


gaaaaagaga 


gaaggggacc 


aagaggaacg 


ggatcgtgga 


ttgaagctct 


tccttagcga 


3600 


cgaggagcta 


aagaggctgc 


atgagttcga 


ggagcagtgc 


gtgcaggagc 


acttccggga 


3660 


gaaggaggat 


gagcagcagt 


cgtccagcga 


cgagcgcatc 


cgggtcactt 


ctgaaagagt 


3720 


tgaaaatatg 


tcaatgaggt 


tggaagaaat 


caatgaaaga 


gaaactttta 


tgaaaacttc 


3780 


cctgcagact 


gttgaccttc 


gacttgctca 


gctagaagaa 


ttatctaaca 


gaatggtgaa 


3840 


tgctcttgaa 


aatcttgcgg 


gaatcgacag 


gtctgacctg 


atccaggcac 


ggtcccgggc 


3900 


ttcttctgaa 


tgtgaggcaa 


cgtatcttct 


ccggcaaagc 


agcatcaata 


gcgctgatgg 


3960 


ctacagcttg tatcgatatc 


attttaacgg 


agaagagtta 


ttatttgagg 


atacatctct 


4020 


ctccacgtca 


ccagggacag 


gagtcaggaa 


aaaaacctgt 


tccttccgta 


taaaggaaga 


4080 


gaaggacgtg aaaacgcacc 


tagtcccaga 


atgtcagaac 


agtcttcacc 


tttcactggg 


4140 


cacaagcaca 


tcagcaaccc 


cagatggcag 


tcaccttgca 


gtagatgact 


taaagaacgc 


4200 


tgaagagtca 


aaattaggtc 


cagatattgg 


gatttcaaag 


gaagatgatg 


aaagacagac 


4260 


agactctaaa 


aaagaagaaa 


ctatttcccc 


aagtttaaat 


aaaacagatg 


tgatacatgg 


4320 


acaggacaaa 


tcagatgttc 


aaaacactca 


gctaacagtg 


gaaacgacaa 


atatagaagg 


4380 


cactatttcc 


tatcccctgg 


aagaaaccaa 


aattacacgc 


tatttccccg 


atgaaacgat 


4440 


caatgcttgt 


aaaacaatga 


agtccagaag 


cttcgtctat 


tcccggggaa 


gaaagctggt 


4500 


cggtggggtt 


aaccaggatg 


tagagtacag 


ttcaatcacg 


gaccagcaat 


tgacgacgga 


4560 


atggcaatgc 


caagttcaaa 


agatcacgcg 


ctctcatagc 


acagatattc 


cttacattgt 


4620 


gtcggaagct gcagtgcaag 


ctgagcaaaa 


agagcagttt 


gcagatatgc 


aagatgaaca 


4680 


ccatgtcgct gaagcaattc 


ctcgaatccc 


tcgcttgtcc 


ctaaccatta 


ctgacagaaa 


4740 


tgggatggaa aacttactgt 


ctgtgaagcc 


agatcaaact 


ttgggattcc 


catctctcag 


4800 


gtcaaaaagt 


ttacatggac 


atcctaggaa 


tgtgaaatcc 


attcagggaa 


agttagacag 


4860 


atctggacat 


gccagtagtg 


taagcagctt 


agtaattgtg 


tctggaatga 


cagcagaaga 


4920 


aaaaaaggtt 


aagaaagaga 


aagcttccac 


agaaactgaa 


tgctagtctg 


ttttgtttct 


4980 


ttaatttttt 


ttttttaaca 


gtcataacca 


ctaatgggtg 


tcatcttggc 


catctaaaca 


5040 


tcatcaattt 


ctaaaaacat 


tttccttaaa 


aaattttgga 


aattcagact 


tgatttacaa 


5100 
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tttaatgcac 


taaaagtagt 


attttgttag 


catatgttag 


taggcttagt 


tttttccagt 


5160 


tgcagtagta 


tcaaatgaaa 


gtgatgatac 


tgtaacgaag 


ataaattggc 


taatcagtat 


5220 


acaagattat 


acaatctctt 


tattactgag 


ggccaccaaa 


tagcctagga 


agtgccctcg 


5280 


agcactgaag 


tcaccattag 


gtcactcaag 


aagtaagcaa 


ctagctgggc 


tcagtggctt 


5340 


atgcctgtaa 


tcctagcact 


ttgggaggct 


gaggtaggcg 


gattgcttga 


ggccaggagt 


5400 


tcaagaccaa 


cacggccaat 


gtggcgaaac 


cccgtctcta 


ctgaaaaata 


caaaattagc 


5460 


cgggtgtggt 


ggtgggtgct 


tgtagtctca 


gctacttggg 


aggctgaagc 


aagagaattg 


5520 


cttgaatcca 


ggaggaggag 


gttgcagtga 


gcccagatca 


tgccactaca 


ctccagcctg 


5580 


ggtgacagag 


tgaggcactg 


tctcaaaaaa 


aaaaaaaaaa 


aaaaaaaaga 


tactgaacta 


5640 


ggcacttgac 


atccaagatg 


tccacttggc 


ccttttctaa 


gtgtacttca 


ctttgttttc 


5700 


attcctcctc 


taaagctttt 


taacaaactg 


tcactcatgt 


tctaaaaaca 


aacaaatgtc 


5760 


tttcatctca 


ttttttactc 


agcattatct 


atcaaaggga 


ctggatgtcc 


catctagttt 


5820 


gtactttagt 


ttctagctca 


ttacatagga 


aatgcaaata 


aaaaacacaa 


tgagatacca 


5880 


cttcacaccc 


actaagatgg 


ccataactta 


ttttttttaa 


aaaaaggaaa 


aggaaaataa 


5940 


taagtgttgg 


tgaggaagtg 


gacagaaatt 


ggaattctca 


tacattgctg 


gtgggaatgt 


6000 


ggaatggttc 


aactgctgtg 


gaaaacagtt 


tgtctgttcc 


tcaaaaagct 


aaacatagaa 


6060 


ttaacatatg 


acccagcagt 


tccactccta 


gatatatacc 


caagagaatg 


caaaataggc 


6120 


attcaaacaa 


atacaggtac 


acaaacattc 


atagcagcac 


tattcccaat 


agccaaaagt 


6180 


cagaaacaac 


ccaaagtcca 


tcaatgaatg 


agtggataaa 


taaattgtgc 


tataaacaca 


6240 


taatacaatt 


ttatttggcc 


ataaaaatga 


agtgttgata 


catgctacta 


tgtaaataaa 


6300 


tctcaaaaac 


ttgtactaag 


tgaaagaagc 


cagacacaaa 


aggtctcata 


caattcgatt 


6360 


tatataaaat 


agcaagaata 


gataaaacca 


tagagacaga 


aagcagattg 


gtggttgctg 


6420 


ggggctggag 


gaaggaagga 


atggagagta 


tctgcttaat 


gggatggggt 


ttccttttag 


6480 


aggagaacat 


tttttttgga 


actttaaata 


aaggtggtgg 


ttatgcaaca 


ttgttaacaa 


6540 


tactaagtgg 


cactgaactt 


ctcactttaa 


aatagttaat 


tttatgttat 


gtgaagttta 


6600 


cctcaataaa 


aaaattctta 


aaaaaaaaaa 


ctcagtgata 


tccaaccatt 


aaaaaatata 


6660 


agttacatag 


ctcgtatata 


atcaacttac 


ttattactgc 


actatgaggg 


gcacccagat 


6720 


atgtaagaca 


gagtcccaac 


ctttggttgt 


tggaggacta 


tctaaggata 


cagagattaa 


6780 


taaaaataaa 


agaactacca 


cgaaagaagt 


gccctccaag 


tacaaagaga 


gttcaaatat 


6840 



WO 02/068633 PCT/USO 1/436 12 

121 



gtacatctgg 


ttgcggagaa 


aggaaaaact 


tctcaaagaa 


gaatttgaag 


tgagctttaa 


6900 


gggttgaatg 


gaatttttac 


tgttagaggt 


agagaaatac 


aatatcataa 


agaaagatgt 


6960 


gatccctaca 


agaggattac 


gtggtaagtt 


gaaagacatt 


aaacagtcca 


acctggtcat 


7020 


tgtcaaaact 


atatacgttg 


ggcacagaac 


tgaagaccag 


gtgtcaaagg 


aggatggctc 


7080 


tgtccccttt 


gtgtccccag 


tgcctaaagc 


agtgtttggt 


gcatgcttgg 


tgattggtaa 


7140 


atgtcaaaga 


aaaagataaa 


agataaaata 


tagccgggcg 


cggtggctca 


agcctgtaat 


7200 


cccagcactt 


tqqgaggccg 


agcttaacat 


tgagaaaaga 


aaggaaatgc 


aacaagaaaa 


7260 


gcagaaagca 


cttgatgtag 


aagcaagaaa 


gcaggttaac 


aggaagaaag 


ctttactgac 


7320 


tcgtgtccag 


gagattcttg 


acaatgttca 


ggttagaaaa 


gcacctaatg 


ccagtgattt 


7380 


tgatcagtgg 


gagatggaaa 


cagtttactc 


taattcagaa 


gtcagaaact 


tgaatgttcc 


7440 


tgctacattt 


ccaaatagct 


ttccaagcca 


tacggaacac 


tctactgcag 


caaagcttga 


7500 


taagatagct 


qqqattttgc 


cattggataa 


tgaggaccaa 


tgtaaaactg 


atggaataga 


7560 


cttagctaga 


gattcagaag 


gatttaattc 


tccgaagcaa 


tgtgatagtt 


ccaatattag 


7620 


tcatgtagaa 


aatgaagctt 


ttccaaagac 


ctcttcagca 


accccacaag 


aaactcttat 


7680 


ttctgatggt 


cccttctcag 


taaatgaaca 


acaggatcta 


ccacttttgg 


cagaagtcat 


7740 


cccagatccc 


tatgtaatga 


gtcttcagaa 


tctgatgaaa 


aagtcaaagg 


aatatataga 


7800 


aagagaacaa 


tctagacgca 


qtctgagagg 

3 w 3 w 3 3 3 


tagtatgaac 


agaattgtta 


atgagagtca 


7860 


tttagacaaa 


gaacatgatg 


ctgttgaagt 


ggctgactgt 


gtaaaagaga 


aaggccagtt 


7920 


gacaggcaaa 


cactgtgtct 


cagttattcc 


tgacaaacca 


agccttaata 


aatcaaatgt 


7980 


tcttctccaa 


ggtgcttcca 


ctcaagcaag 


cagcatgagt 


atgccagttt 


tagctagctt 


8040 


ttcgaaagtg 


gacataccta 


tacgaactgg 


ccatcccact 


gttctagagt 


ctaattctga 


8100 


ttttaaagtt 


attcccactt 


ttgttaccga 


aaataatgtt 


atcaaaagtc 


ttacaggttc 


8160 


atatgccaaa 


ttacctagtc 


cagagccaag 


tatgagtcct 


aaaatgcacc 


gaagacgttc 


8220 


caggacatca 


tcagcgtgfcc 


acatacttat 


aaataaccca 


ataaatgcct 


gtgaattaag 


8280 


ccctaaagga 


aaacraacacjQ 


caatggactt 


aattattcaa 


gatactgatg 


aaaacacaaa 


8340 


tgtgcccgaa 


attatgccaa 


agttaccaac 


tgatttagcg 


ggagtttgtt 


caagcaaggt 


8400 


ttatgtgggc 


aaaaatacat 


ctgaagtcaa 


agaagatgtg 


gttttaggta 


aatcaaatca 


8460 


ggtatgtcaa 


tcttcaggaa 


atcatttaga 


aaataaagtt 


actcatggac 


ttgttactgt 


8520 


ggaaggtcag 


ttaacatccg 


atgagagagg 


cgcacacata 


atgaacagta 


cctgtgctgc 


8580 


gatgccaaag 


ctgcatgaac 


catatgccag 


cagtcagtgt 


atagcaagtc 


caaactttgg 


8640 
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aactgtgagt 


ggactcaagc 


cagccagtat 


gttagagaaa 


aactgcagtt 


tgcaaacaga 


8700 


actgaataag 


tcttatgatg 


taaaaaaccc 


ttctccttta 


ttgatgcaaa 


accagaatac 


8760 


gagacagcag 


atggacacac 


ctatggtgtc 


ctgtggaaat 


gaacaatttt 


tggataacag 


8820 


ttttgagaaa 


gttaaacgga 


gacttgattt 


agatattgat 


ggtttgcaaa 


aagaaaactg 


8880 


cccttatgtc 


ataacaagtg 


gaataactga 


acaagaaagg 


caacatttgc 


cagaaaaaag 


8940 


ataccctaag 


ggatctggct 


tcgttaacaa 


gaataaaatg 


ttaggaacta 


gttccaaaga 


9000 


aagcgaggag 


ttactaaaaa 


gcaagatgtt 


agcttttgaa 


gaaatgcgga 


agagactaga 


9060 


agaacagcac 


gcccagcaat 


tatcactact 


catagctgag 


caggaaaggg 


aacaagaaag 


9120 


actgcaaaag 


gtttttagtc 


tggaaataca 


agcaaaattt 


aacaaaataa 


ctgcagtggc 


9180 


aaaaggattt 


cttactcgta 


gacttatgca 


gacagataag 


ctgaagcaac 


ttcgacaaac 


9240 


tgtaaaattg 


cgagctgcct 


tgtacggtat 


tcatgacata 


ttctttgtaa 


tggatgcagc 


9300 


tgaaagaatg 


tctattctac 


atcatgatcg 


agaagttcgc 


aaagagaaaa 


tgctcaggca 


9360 


aatggataaa 


atgaaaagtc 


cacgagtggc 


tctttcagct 


gcaacacaga 


agtctcttga 


9420 


taggaagaaa 


tacatgaatg 


ttgggaatag 


caggtacgag 


tggatatact 


ttcctactca 


9480 


tactgagctg 


aacactggag 


ggcccgttct 


ggagatgatc 


cttttccctg 


tcataggtca 


9540 


ccatctcatg 


aacaaccaca 


gaccaccacc 


cccgaatgaa 


aacatggcgt 


ctgcagaagg 


9600 


tgctgtgcgg 


ccgctgcggg 


gcggccagct 


gcccttcaca 


aacatggcgg 


ccgaggggtg 


9660 


cggggagtgg 


cggggtaagg 


atgggaagcc 


gagcagacgg 


ccccagaaca 


agcggtcatg 


9720 


tgactgggaa 


gatggccgtc 


tttccttgtt 


tattattaag 


gatattacaa 


agaatacaga 


9780 


tgaacagcca 


gatgaagagc 


tacgtagggt 


cacagagtca 


aagacaaaga 


aagtgaaccg 


9840 


gaaaggaagc 


acttcttcca 


cgtcctcctc 


ctcctccagc 


tccgtggtgg 


acccgctgag 


9900 


cagcgtcctc 


gatgggactg 


accccctctc 


catgtttgca 


gccactgctg 


accccgcagc 


9960 


cttggcagct 


gccatggaca 


gctccagaag 


gaaacgtgat 


agagatgata 


actccgttgt 


10020 


aggatcggat 


tttgagcctt 


ggaccaacaa 


acggggagaa 


atccttgccc 


ggtacaccac 


10080 


taccgaaaag 


ctgtctatta 


atctgtttat 


gggatctgaa 


aaaggcaaag 


ctgggactgc 


10140 


cacattggca 


atgtcagaga 


aggtgcggac 


ccggctggag 


gagctggatg 


actttgagga 


10200 


gggttcccaa 


aaggagctgt 


tgaacttgac 


tcagcaggat 


tacgtgaacc 


gcatagagga 


10260 


gctcaaccaa 


tcgctgaagg 


atgcctgggc 


ctcagaccag 


aaagtgaagg 


ctctaaaaat 


10320 


agtcatccag 


tgttcaaagc 


ttctttcaga 


caccagtgtt 


attcagttct 


acccaagcaa 


10380 
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atttgtcctt 


atcaccgaca 


tacttgatac 


atttggaaag 


ctcgtgtacg 


agcgcatctt 


10440 


ttccatgtgt 


gtggatagcc 


gcagcgtctt 


accaggatgt 


ttttgtagtt 


acgtggaggc 


10500 


atccatcctg 


aaatgtaaca 


aattcctctc 


caaaacggga 


atttcagagt 


gcctgccccg 


10560 


gttgacatgc 


atgatcagag 


ggatcggaga 


cccactagtg 


tcggtgtatg 


cccgtgccta 


10620 


cctgtgccgg 


gtgggaatgg 


aagtggcccc 


acatctcaaa 


gaaaccctaa 


ataagaactt 


10680 


ttttgacttc 


ctccttacgt 


tcaaacagat 


tcatggggat 


acggtccaga 


accagctggt 


10740 


ggtccaagga 


gtggagctcc 


catcttacct 


ccccttgtac 


ccgcctgcca 


tggactggat 


10800 


cttccagtgc 


atctcctacc 


atgcccccga 


ggctctgctg 


accgagatga 


tggaaaggtg 


10860 


taagaaacta 


ggaaacaatg 


ccttgctgtt 


gaattctgtg 


atgtctgcct 


tccgggctga 


10920 


gttcatcgcc 


acaaggtcta 


tggatttcat 


tggcatgatt 


aaagagtgtg 


atgaatctgg 


10980 


tttccccaag 


catcttcttt 


ttcgatcact 


gggattaaac 


ttggccttgg 


ctgatcctcc 


11040 


tgagagtgac 


cgacttcaga 


ttctcaacga 


agcttggaaa 


gtcatcacta 


agctgaagaa 


11100 


cccacaggac 


tacattaatt 


gtgccgaagt 


gtgggtggaa 


tacacctgca 


agcatttcac 


11160 


gaaacgagag 


gtgaataccg 


ttttggcaga 


tgtcatcaag 


cacatgactc 


cagatcgtgc 


11220 


atttgaagat 


tcctaccccc 


agcttcagtt 


aataattaag 


aaagttattg 


cccacttcca 


11280 


tgacttctca 


gttcttttct 


cagtggaaaa 


atttctgccg 


tttctggaca 


tgttccaaaa 


11340 


agagagtgtg 


cgggtggagg 


tttgcaaatg 


catcatggac 


gcctttatca 


agcatcaaca 


11400 


agagcccacc 


aaggacccgg 


tcatcttgaa 


tgcccttttg 


catgtttgca 


agaccatgca 


11460 


tgactctgtg 


aatgcactca 


ctcttgagga 


tgagaaaaga 


atgctgtcat 


atttgattaa 


11520 


tggatttata 


aaaatggttt 


cctttggccg 


tgattttgaa 


caacagctga 


gtttttatgt 


11580 


tgagtccagg 


tcgatgtttt 


gcaatctgga 


gcctgttctt 


gtgcagttga 


ttcatgcctg 


11640 


tgttgcctac 


tgcttcatca 


ccatcccctc 


cctggcgggc 


atcttcacac 


gtctcaatct 


11700 


ctacctgcat 


tctggtcagg 


tggccttggc 


caaccagtgc 


ctctcccaag 


ctgatgcttt 


11760 


tttcaaagcc 


gctataagcc 


ttgttccgga 


agttccaaag 


atgattaata 


ttgatgggaa 


11820 


gatgcggcca 


tcggaatcgt 


tccttctgga 


attcctctgc 


aatttctttt 


ctactttatt 


11880 


glcl La^ L. Ulrft— y 




aacai-ggggu 




gcccgagagc 




±xy t ±\J 


gatccaggac 


tacacctggg 


aggacaacag 


cgatgagaaa 


atccgcatct 


acacctgcgt 


12000 


cctgcatctc 


ctctccgcca 


tgagccagga 


gacgtacctt 


taccacatag 


acaaagtgga 


12060 


ctccaacgac 


agcctctacg 


ggggagactc 


caagttcctg 


gcagaaaaca 


acaagctgtg 


12120 


tgagacggtg 


atggctcaga 


tcctagagca 


tctgaaaacc 


ctggccaagg 


acgaggccct 


12180 



WO 02/068633 PCT/USO 1/436 12 

124 



gaagcgccag agctcgttgg 


gcctttcctt 


ctttaacagc 


atcttggccc 


atggggacct 


12240 


acgcaacaac 


aagctcaacc 


agctctccgt 


caacctgtgg 


cacctggcac 


agaggcacgg 


12300 


ctgtgcagac 


accaggacca 


tggtgaaaac 


gctagaatac 


atcaagaagc 


aaagcaaaca 


12360 


accagacatg 


actcatctga 


cggagctggc 


cctcagactc 


cctctgcaaa 


caaggacctg 


12420 


a 












12421 


<210> 129 

<211> 1494 

<212> DNA 

<213> Homo sapien 












<400> 129 
cggccgccgg 


gcaggtacta 


ggttctgaca 


tcagtgaagt 


agcttccatt 


tcatttctct 


60 


gagtgcttca 


ctcagcaaga 


gactatgcga 


caactacgtg 


ctgggtgttc 


cagatgtatt 


120 


gaactctgga 


caaactgctt 


gacttcagcc 


tcttgaacta 


ttgttaatac 


acgggcagct 


180 


tgttatatag 


ctcccattgg 


tatacaactt 


gctaattaaa 


tggaaaatac 


ccttgggccc 


240 


acgtgccatt 


gtgcttgtgc 


tatacaacca 


tggtaaattc 


cccaggcatt 


ttattgggcg 


300 


cagtgtccac 


atgggccgtg 


tgttggtatt 


caacccttgg 


aagtgctcag 


gggaagattc 


360 


tgaagaaacc 


cattccttag 


tcgccaaagc 


gattggtttg 


acatgcccgc 


cggctctatt 


420 


ctaacctaac 


agacagaatt 


acggaagaca 


ttttatgtgc 


ttgttgtgtc 


cgctatgtgt 


480 


ttgtggctgc 


ggtgacaaac 


ccagttgcta 


aacacatacc 


ttccaaggcc 


tttacctttt 


540 


gcaacgcaat 


gtggcctttg 


cacacaagta 


ctggagaggg 


aagattaccg 


aactatatga 


600 


aaacccctct 


gtgcggaacg 


gtggcccatg 


gaacttggct 


atttggccac 


atggctagtt 


660 


aaggccacct 


aaaacggcaa 


aattttgggt 


tggtcgccta 


agcgtttggt 


caaatcagtt 


720 


gcccctaatg 


tggccctttg 


ggtgcacggt 


caactttatc 


cgccgaagtt 


tccgattcgt 


780 


aatagaatcg 


tagcatcatg 


acatgaaacg 


acacacacac 


acacaaaaca 


cactacaata 


840 


gcagcgctca 


ctgatgcagc 


gagcggtaaa 


taaccaccct 


ccgacagaga 


ggggcggacc 


900 


gaccacaaaa 


tagagacgca 


gtgtagagta 


ctacacacca 


cccaggttgt 


tagttgggta 


960 


gtgcagacac 


aacattcttc 


ttgtgaatat 


aaagcattcc 


cgcgagtgat 


ccttcctact 


1020 


accagtatta 


tcatccctac 


tcatgctatg 


gctacatcgt 


gcaacaatta 


ttcatgcacg 


1080 


tgcgagtgat 


agcgacacaa 


acatacggtg 


atacatctaa 


ctagacgcta 


acgcacatca 


1140 


gcatccaacg 


agaatatatt 


atgaacagat 


ggagcacgac 


tgaagtcgta 


ggttgattcg 


1200 


cacatgctgg 


acgtatgagt 


atccggagat 


gtcaactgac 


aggagtgcat 


gtgggcacag 


1260 
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ggcttacggc 


agtagtagca 


ccatacggct 


gacctagaga 


tggattggta 


aatcaaagta 


1320 


gtgagggagc ggcggaatgg 


gaagcgtgac 


ggggatgtga 


tgttagctgt 


cggagttgga 


1380 


cggagggagg 


catgggagga 


acgtgtgagc 


aatagagccg 


gcgaggtgtc 


tacaggaagg 


1440 


gagagcagag 


gcgagcacgg 


agcgatcgct 


ggagctgcag 


ctgcccaggc 


ggcg 


14 94 


<210> 130 
<211> 1774 
<212> DNA 
<213> Homo sapien 












<400> 130 
tgtgagttcc 


acagaagatt 


tcagttgaaa 


agcactatta 


atgtttccta 


ctgtggacag 


60 


catttagagg 


aaacatttta 


agcagcacgt 


tcatatttcc 


tctgttcacg 


tacttagtca 


120 


acagctcttc 


attgtgcgct 


tatcgtgtgc 


aaggcagtgt 


tctacatgct 


atggggtaaa 


180 


caagatgaca 


aatccctgtc 


cccttagaaa 


agagcttgca 


accacagtgg 


ctgttggaaa 


240 


gaagcaacag 


tggtgctaaa 


agacaccagg 


taaactggaa 


aggaaaattc 


atcatatagc 


300 


gtactaggtt 


ctgacatcag 


tgaagtagct 


tccatttcat 


ttcttgaggc 


ttcatcagaa 


360 


gagatatgcg 


acaatacgtg 


ctgggtgttc 


cagatgtatt 


gaactctgga 


caaactgctt 


420 


gacttcagcc 


tcttgaacta 


ttgttaatac 


acgggcagct 


tgttatatag 


ctcccattgg 


480 


tatacaactt 


gctaattaaa 


tggaaaatac 


ccttgggccc 


acgtgccatt 


gtgcttgtgc 


540 


tatacaacca 


tggtaaattc 


cccaggcatt 


ttattgggcg 


cagtgtccac 


atgggccgtg 


600 


tgttggtatt 


caacccttgg 


aagtgctcag 


gggaagattc 


tgaagaaacc 


cattccttag 


660 


tcgccaaagc 


gattggtttg 


acatgcccgc 


cggctctatt 


ctaacctaac 


agacagaatt 


720 


acggaagaca 


ttttatgtgc 


ttgttgtgtc 


cgctatgtgt 


ttgtggctgc 


ggtgacaaac 


780 


ccagttgcta 


aacacatacc 


ttccaaggcc 


tttacctttt 


gcaacgcaat 


gtggcctttg 


840 


cacacaagta 


ctggagaggg 


aagattaccg 


aactatatga 


aaacccctct 


gtgcggaacg 


900 


gtggcccatg 


gaacttggct 


atttggccac 


atggctagtt 


aaggccacct 


aaaacggcaa 


960 


aattttgggt 


tggtcgccta 


agcgtttggt 


caaatcagtt 


gcccctaatg 


tggccctttg 


1020 


ggtgcacggt 


caactttatc 


cgccgaagtt 


tccgattcgt 


aatagaatcg 


tagcatcatg 


1080 


acatgaaacg 


acacacacac 


acacaaaaca 


cactacaata 


gcagcgctca 


ctgatgcagc 


1140 


gagcggtaaa 


taaccaccct 


ccgacagaga 


ggggcggacc 


gaccacaaaa 


tagagacgca 


1200 


gtgtagagta 


ctacacacca 


cccaggttgt 


tagttgggta 


gtgcagacac 


aacattcttc 


1260 


ttgtgaatat 


aaagcattcc 


cgcgagtgat 


ccttcctact 


accagtatta 


tcatccctac 


1320 
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tcatgctatg gctacatcgt 


gcaacaatta 


ttcatgcacg tgcgagtgat agcgacacaa 


1380 


acatacggtg atacatctaa 


ctagacgcta 


acgcacatca gcatccaacg agaatatatt 


1440 


atgaacagat 


ggagcacgac 


tgaagtcgta 


ggttgattcg cacatgctgg acgtatgagt 


1500 


atccggagat 


gtcaactgac 


aggagtgcat 


gugggcacag ggcttacggc aguagtayt*a 




ccatacggct 


gacctagaga 


tggattggta 


aatcaaagta gtgagggagc ggcggaatgg 


1620 


gaagcgtgac 


ggggatgtga 


tgttagctgt 


cggagttgga cggagggagg catgggagga 


1680 


acgtgtgagc 


aatagagccg 


gcgaggtgtc 


tacaggaagg gagagcagag gcgagcacgg 


1740 


agcgatcgct 


ggagctgcag 


ctgcccaggc 


ggcg 


1774 


<210> 131 
<211> 531 
<212> DNA 
<213> Homo sapien 








<400> 131 
ccgaggtact 


taattctttg 


gcaacaaaca 


gcaagttttt atggtttaat tgtatttcct 


60 


ctctagagat 


cacaatactc 


tgggtatttt 


atatcccttc taaaggtcat ttgggttttc 


120 


aaatgggaag 


aatagtcaag 


ctaagctgga 


ctaaacctaa gtaatatttt atctcatcaa 


180 


aagaagttat 


taatctaact 


gggttagcat 


gagtcattca ttttttagac atgataaatg 


240 


ggaaacatat 


caaatcatta 


gataaattct 


gaccxggaau cgaatmcccc cuttcttaaa 


■inn 


atctttttaa 


tttgcttttt 


tcatagactc 


agtatggaac tcttaactga taagggagag 


360 


attcttgatc 


tggaaccgtt 


ccctgccatt 


ctcctttttt ctctctgtct aggtagctgg 


420 


ttccatagtg 


cccgtcacga 


gggtccattt 


cagtttgatg acattagact gcttacactc 


480 


agctggatgc 


cttgctgttt 


gcagcaacat 


gattttacag tatgcttttc t 


531 


<210> 132 

<211> 4309 

<212> DNA 

<213> Homo sapien 








<400> 132 
tggggcatga 


agttctgcca 


ctaaggcata 


accagtaccc aagtaggaaa ggattattga 


60 


tccctggagt 


aaagattcct 


agcttgcggg 


gatctcacta ttgatccccc ggagtaaaga 


12 0 


ttcctagctc 


gcaggaatct 


tacttgcaca 


cttaaggaaa ataccagagg gagccactag 


180 


caaccgaaaa 


atgaaggaaa 


gatttaattt 


ctctacccag gtacaaatcc aatgcatagt 


240 


atagtatatg ttatatgcag 


aaagggcact 


ggaggtgtga gagccctttg ggtaacagct 


300 
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ctgctagttg 


ttaccttcac 


acttctcttt 


cttcttaatc 


tctggtttct 


cgtctgaaaa 


360 


atgagataat 


tggactagat 


taatagttct 


cagccttttc 


tagaacaaca 


aattagatta 


420 


tctctgaagt 


ccttttccaa 


aatttcatgc 


ttccagtctc 


tccccgtgat 


tgcatattat 


480 


ccctaggcta 


attaaaagtg 


tgctggttaa 


tcagtttctt 


tacttttttt 


tctctctttt 


540 


ttttattctg 


tgttcaggta 


tcatttgacc 


aacaacttgc 


tcttaatctt 


actgcgatga 


600 


ctgaaatctc 


caaagaggag 


agtatagtag 


ttataggtct 


agtaataata 


atcttgtaaa 


660 


caattcatta 


ttttttaaaa 


cttaatgctg 


tgtgcctaat 


cagtgttgag 


tatgactaaa 


720 


atgtaaatga 


gaaccaaata 


tgacactaac 


gctggcacta 


aattttattt 


ttacttgtgt 


780 


acttaaagta 


ctttactggc 


ctgttgggaa 


attgatttgt 


ataagattca 


cacctggttt 


840 


tggcagaaac 


agctttttaa 


agatacaatt 


aggctgactt 


aaaaatgttt 


tttattccac 


900 


agaatgtatc 


tttattatgt 


cattcttacc 


gtctctctgc 


cttttttgac 


catcatgtaa 


960 


ctacgcagta 


gtaaatcatg 


gggtgcaggc 


tgatgaatta 


gtagtctttg 


taaaactaaa 


1020 


atgtaatgag 


cataggaaga 


gacagctctt 


ctggtgaaca 


tggcagtttt 


ccggtaatac 


1080 


cagccttgat 


gaataggcta 


aagacagctg 


tgtactatgg 


acattcaact 


tcactggatt 


1140 


gcctggctta 


gcttttcagt 


tattgtcttg 


ccgtgaaaca 


tggtgttgca 


ctgtacacat 


1200 


tttagaatag 


tggcaagaat 


gttctttaga 


actgcagata 


acccaacaga 


gaataccata 


1260 


ccaatatatc 


ggtttctcct 


gttacataga 


gatttggtat 


ttcagtgatg 


gttatggttt 


1320 


ttgactcacc 


tgtgtgtcag 


tcctgctaga 


gagacagtat 


agaatcaaga 


atttcttgat 


1380 


gtctttttaa 


aaatagagat 


tataatggcc 


ttgctagttg 


tcctgtgaag 


tgacagaccc 


1440 


ttaattagaa 


atcttttaat 


cctgctttcc 


attttacttc 


tccaccattt 


atttttaaca 


1500 


ttcattctct 


gaaaatagtt 


atgattttta 


gtgtaattgg 


ctatgtttaa 


gaaattccat 


1560 


ttcaaagtct 


taatatgtaa 


caatcttcat 


tttttataaa 


attgaaggac 


atttttgagg 


1620 


caaacttact 


tttatagctc 


attttctccc 


tagttaagga 


gaattctctg 


gtttcatata 


1680 


gagaatggct 


agctctgtga 


taccccaccc 


tctttgtggc 


ctaccagttg 


ctgttgtgtt 


1740 


gcttaaaatt 


gcagatattc 


tcataaacat 


gatgattttt 


ggcacaggcc 


tttctttgct 


1800 


afctgafcttca 


aaacaaagt. t- 


gggcaatccg 


atttgaacta 


ctttaataaa 


cataatttag 


T Q C A 


cattcctgta 


atgagaaagt 


attttcaaaa 


gataaccaga 


tttatttcta 


ggattagaga 


1920 


tggcaaatac 


caatgatagt 


atttttccca 


agagcaatca 


cagattaatc 


tataaatatg 


1980 


agaagtcgca 


ttattgtact 


taattctttg 


gcaacaaaca 


gcaagttttt 


attgtttaat 


2040 


tgtatttcct 


ctctagagat 


cacaatactc 


tgggtatttt 


atatcccttc 


taaaggtcat 


2100 
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ttgggttttc 


aaatgggaag 


aatagtcaag 


ctaagctgga 


ctaaacctaa 


gtaatatttt 


2160 


ttctcatcaa 


aagaagttat 


taatctaact 


gggttagcat 


gagtcattca 


ttttttagac 


2220 


atgataaatg 


ggaaacatat 


caaatcatta 


gataaattct 


gacctggaat 


tgaattcccc 


2280 


ctttcttaaa 


atctttttaa 


tttgcttttt 


tcatagactc 


agtatggaac 


tcttaactga 


2340 


taaggagaga 


ttcttgatct 


ggaaccgttc 


cctgccattc 


tcctgttttt 


ctctctgtct 


2400 


aggtagctgg 


ttccatagtg 


cccgtcacga 


gggtccattt 


cagtttgatg 


acattagact 


2460 


gcttacactc 


agctggatgc 


cttgctgttt 


gcagcaacat 


gattttacag 


tatgcttttc 


2520 


tcaaagcttt 


gcattcttaa 


tggagatatc 


aaatggtgta 


attccaaata 


taaatatgtt 


2580 


tatgacacta 


atcatatgct 


tttaacaata 


actttttgat 


aacttattgc 


cctgtaagtt 


2640 


aaaccttaca 


cggagtgact 


gttgccatca 


gaaaatcccc 


tttcacctgc 


tagagagagt 


2700 


gaaagtagtg 


cgtggcctcc 


acttttcagt 


taacgagtag 


caagcttcta 


gtaggggctg 


2760 


cttatctgac 


ccatgtgggg 


tcttgggcct 


tgttactttc 


ctgggttcgt 


cccttcagct 


2820 


ggaaaggctg 


ttgaaaacac 


ttgccaggaa 


acagtaaagc 


tgtgagagaa 


tcttctagtc 


2880 


ttagatatag 


agaaatgagt 


accagttgat 


gctaacacga 


tacttagact 


ttgaggggct 


2940 


tccatccaag 


ctttggcatg 


aatctgttag 


acacggtttt 


cttcactcct 


ctgataatct 


3000 


ttttttccag 


aatggttttt 


tgttgttgtt 


gttgttaaca 


aattctaaac 


atccagtgtt 


3060 


actttttgtt 


ttgtttgttt 


tgttttgttt 


tgttttgttt 


tgtttgccct 


tcaggaaagt 


3120 


cctttttcga 


gtaggtattg 


attgttccct 


ggaaactctc 


agcagtgtct 


gtgcaggctc 


3180 


tgtgcatgct 


ttgtatgagt 


tcggcttgaa 


caatgcattt 


gaagtcacct 


gggatgtcca 


3240 


gttctggcat 


gtcttcattg 


attgtgtttt 


taaacatgtt 


tcatgtttca 


tgtcattttc 


3300 


caaacctcac 


tttacatctt 


actcagagaa 


gttaatcaag 


gaatgaaatt 


tctaggggga 


3360 


tagggatgat 


gagggtgggt 


tgggggcttg 


agtgaagtca 


acttggggta 


tttgctttaa 


3420 


agtgttttct 


aaagcagttc 


ctaacagtat 


ttagagattc 


cctgggatgt 


ttgtggctca 


3480 


acttattctg 


gacaaagtgt 


gtgtgggggt 


ggtcttactg 


agatttgcat 


tcttaatgca 


3540 


ggacaggctc 


taaatttcat 


ctgtactcta 


aatttgtgat 


tgaatccaag 


aagataacag 


3600 


agacagtgct 


cctgttgtaa 


tgtttctggc 


aagtgctccc 


taaaatgcac 


atcgaattct 


3660 


gttttctggg 


ccttttctcc 


aatggtgcta 


ggagataccg 


ttgatttctg 


cagctcttct 


3720 


cagtggtggg 


aagaagtctt 


tgggattgtt 


gagcaagggg 


cagctggacc 


atccactaaa 


3780 


tttttttgtt 


caagacacat 


tagagaccct 


cctgtatatc 


tagtaagtca 


taataaaggt 


3840 
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gcttgggaaa gccttaaatt 


tgaagacaca 


tggaggcggt 


agaaaattaa 


acttgtaaga 


3900 


ggagaaaaac atgccattag 


gtaacgcaga 


gttgtaacta 


ctggctaaga 


ctcaatggaa 


3960 


cttccacttg 


ctctaaaacc 


agggaaggga 


gtgggacgat 


aagtcttttg 


aagacatcag 


4020 


ctcactgtgc 


tgagagaggg 


accaaactca 


aggaaacctc 


tgatctatac 


attcaactgc 


4080 


tgcatttttt 


tataaataca 


tgtaaatgtc 


ctgttgtaat 


atttgatctt 


ggaaataaaa 


4140 


acaaaaactt 


ttcgtagaaa 


aaacacacac 


gacaaaaaca 


aaatttgggg ggccggcgcc 


4200 


aagaagtttc 


aaccacatgt 


gtggggccgg 


cccaggtgaa 


ggaaatggca 


agtggccaag 


4260 


aaccggccag aaggggggtt 


tcccaaaaaa 


cgcgcgggcg 


cgcgtttga 




4309 


<210> 133 
<211> 730 
<212> DNA 
<213> Homo sapien 












<400> 133 
gccgccgggc 


aggtacaacg 


tggagatttc 


cctctgggaa 


aatcacccag 


caagtgaaat 


60 


ccgacccaga 


gggggacaca 


ggagtgctcc 


cgtgggacag 


cgtggccacc 


actgggaggg 


120 


gctctggcag 


tttttcttgt 


tttttacgct 


gcgatgtctg 


tgaatcttga 


gagcccagct 


180 


gatgcgttgt 


aacctcacct 


gggggcgatg 


tggaacattc 


caggcctggc 


aggggctatg 


240 


cctgccatgc 


agacctcacc 


cgagccatct 


caccctgggt 


ctgtcagagt 


cccacgtgct 


300 


gtcgcccctc 


acccaccccc 


gacggggcca 


tgctcctggt 


catgctgtga 


ttcattcatc 


360 


attccatggg 


caggtgtggg 


attgtctctg 


tgtttctgtc 


tgctttttaa 


agaagatgag 


420 


gtaagcatgg 


aaaacaaaac 


aaacgtggta 


accccatcac 


tcagacgtgt 


ccactgttag 


480 


cattcgggaa 


caggtgagag 






aaggctggct ggcccagact 


540 


ttggtcttgt 


gctgtgtgat 


cttggggaaa 


tcactgattt 


ctgagccttt 


ttcccagctg 


600 


ctttttctct 


tggggctggt 


agaaggaata 


agagagtgag 


tgtcaaaaaa 


aaaaaaaaaa 


660 


aaagcttggg 


gttaatcatt 


ggcaaatatg 


ttcccggtta 


aaattgtttc 


ggtcaaatcc 


720 


atttggagaa 












730 



<210> 134 

<211> 226 

<212> DNA 

<213> Homo sapien 



<400> 134 

gcccgggcag gtactgggtt gcagcaagct atgatgccct gaggttccct gaagacaagt 



aagaacacac tcaacttgtc actcagaaac gcagccctta aaggcttcca gataagaata 



60 
120 



WO 02/068633 PCI7US0 1/436 12 









130 








atgactgggt 


aaagccaggc 


tgccacctaa 


gctgaagcta 


caggtaggtg 


actaaggaaa 


180 


cactgatgtc 


aggccagcct 


acraecaacfc t 


w bw^auy w 


tcttta 




226 


<210> 135 
<211> 937 
<212> DNA 
<213> Homo sapien 












<400> 135 

tttttttttg 


agatggagtc 


tccrctfaaat 

LwyVii vV#a **3 w 


a " c* a crop a 


aaaacaataa 


cfccaatcfcca 


60 


gctcactgca 


acc tccgcct 


crtaoattca 


UGLwg d W Wl* 


rtacctcaac 

w d»y w j j 


cttctaaota 


120 


gc tgggat t a 


caggcatgcc 


ccaccacocc 


raacfaattt 


fctatattttt 


crcrfcaGacrafccr 


180 


gggtt tcacc 


atattgtcca 


ggctggtctc 


aaacccctga 


cctcaggtga 


tc tggccacc 


240 


4~ /"^ ^ r+ 4" 4^ ^» 

tCayCCtttC 


a ^ ^ 4~ rt rt 4— r*T 3 

aaag uyct-y a 


gattataggc 


gtgagccac t 


gtgcccagcc 


gcagaaaagc 


300 


ttcttacctt 


gagcaggggc 


ah c t c t ccrcr cr 

www W W wg g 


cctgc tggat 


taaaaggtgc 


at ttcgttga 


360 


tctgctgccg 


cacaaggggc 


yytattyyy t. 


L.ytayL>aayL 


i" at*oat*crp.cc 
Lauycii«y 


t* era crcr 1 1 c c c 


420 


tgaagacaag 


taagaacaca 


t> w aat w i«y u. 


i«a.w vvciy &oci 


rcrraaccctt 

w g »w» CL3 V«p W \* w W 


aaaaacttcc 

a ci »g 3 *•* v» w> w 


480 


agataagaat 


aatgactggg 


taaaaccaaa 


ctaccaccha 

W W g W w Ck w w w 


aercfcaaaacfc 


acaactaaat 


540 


gactaaggaa 


acactgatgt 


w v* *j *j w ^ 


taggccaact 


tctccatgtg 


t tctttQCcra 


600 


caccaagctt 


taagtggatc 


cctc ttggag 


cc tgactccc 


acaatcaggg 


aaaactqaaa 


660 


gtaagtgcca 


cagcactgtg 


aattctcaat 


fccccreaacTCfa 

w v g w g 3 33 


gcaaccaacg 


cractQtcJcrat 

33^ w 3 ^-3 3**^* 


720 


tttcattctt 


gcttatactt 


caatcaagat 


cacaaaacgt 


ttaagatcat 


gagctgettt 


780 


aatttggaag 


aaagtacaga 


tgacagaatc 


tgggacattt 


tcttcccttg 


caagtagatg 


840 


ctataaggct 


cagagactta 


aaataaactc 


aaactaaggc 


ataatcattg 


aaatgattcc 


900 


agttcagaag 


ttttagtgct 


taaatggcag 


cctoaaq 






937 


<210> 136 

<211> 96 

<212> DNA 

<213> Homo sapien 












<400> 136 
gcccgggcag 


gtaccaatca 


attttgaaaa 


tgaaagggaa 


aatacttatc 


tttcctatat 


60 


gaattgggaa 


aataaaagga 


tatgaaaagg 


taaata 






96 



<210> 137 
<211> 151 
<212> DNA 
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<213> Homo sapien 
<400> 137 

aaaatgtagg cagcaaaagt ggaagaggag aggcagctgg tgcactaatc caggtgagag 60 
gtaaagatgg atgggtaggg ctgaaggatg gtggggcagg tggtgagaag tgacttggtt 120 
ctggagacat atgaaggaag atggtcaggc a 151 

<210> 138 

<211> 604 

<212> DNA 

<213> Homo sapien 

<400> 138 

cgtggtcgcg gcgaggtact ttcttagacc agtgtaacct cacacctcag tttggctttt 60 

ccaaccctga cttgaaaggc atatttgtat ctttttatta gtgatagtga agctgtgaca 120 

ctaacctttt atacaaaaga gtaaagaaag aaaaactaca gcgattaaga tgagaacagt 180 

tctgcagttg ttgaactaga tcacagcatt gtaggcagaa taaaaaatgt tcatatctga 240 

gaatattcct ttcgccatct tttcccaagg ccagacctcc tggtggagca cagttaaaag 300 

taacattctg ggccttgtaa tcggagggct gtgtctccag ctggcagcct tgttttaata 360 

tataatgcag gactgtggaa aacagttggc atagaatatt ttcacctaaa aaagaaagaa 420 

aagacataca aaactggatt aattgcaaaa agagaataca gtaaaatacc atataactgg 480 

acaaagctag aagaaccttt agaagatttg tctgaaaaca gatttcaaga gtgagctttt 540 
atacactgct cactaatttg cttgattact accaactctt cttaaagtta acacgtttaa 
" ataa 



600 
604 



<210> 139 

<211> 4461 

<212> DNA 

<213> Homo sapien 

<400> 139 

tgagttttcc cctgttggcc agggatggtc tcgatctcct gacctcgtga tctgtccacc 60 

tcggcttccc caagtgctgg gattacaggc atgagccacc gcgcccagcc tagccatatt 120 

tttatctgca tatatcagaa tgtttctctc ctttgaactt attaaccaaa aaggaacatg 180 

cttttcatac ctagagtcct aatttcttca tcatgaaggt tgctattcaa attgatccat 240 

cattttaatt ttaccaatgg ctcaaaaatt ctgttcagta aatgtctttg tgactggcca 300 

atggcataaa ttatgtttaa gattatgaac ttttctgaca gttgcagccc atgttttccc 360 

tacgatacca gatttccatc ttggggcata ttggattgtt gtatttaaga ccgtcagaat 420 
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aatgatagtg tgtggtctcc agaggtagtc agaatcctgc tattgagttc 


tttttatatc 


480 


ttccttttcc 


attttttatt 


accattttgt 


ttgtttagac 


tacactttgt 


agggattgag 


540 


gggcaaatta 


tctcttggag 


tggaattcct 


gtgttttgag 


ccttacaacc 


aggaaatatg 


600 


agctatacta gatagcctca 


tgatagcatt 


tacgataaga 


acttatctcg 


tgtgttcatg 


660 


taattttttg 


agtaggaact 


gttttatctt 


gaatattgta 


gctaactata 


tatagcagaa 


720 


ctgcctcagt 


ctttttaaga 


aggaaataaa 


taatatatgt 


gtatgaattt 


atatatacat 


780 


atacactcat 


agacaaactt 


aacagttggg gtcattctaa 


cagttaaaac 


aattgttcca 


840 


ttgtttaaat 


ctcagatcct 


ggtaaaatgt 


tcttaatttg 


tctgtgtaca 


ttttcctttc 


900 


atggacagac 


cattggagta 


cattaatttt 


cttaatctgc 


catttggcag 


ttcatttaat 


960 


ataccatttt 


ttggcaactt 


ggtaactaag 


aatcacagcc 


aaaatttgtt 


aacatcaaag 


1020 


aaagctctgc 


catatacccc 


gttactaaat 


tattatacat 


ccagcagatt 


ctgggatgta 


1080 


ctaacttagg 


gttaactttg 


ttgttgttga 


taatactaga 


ttgctccctc 


tttaattctt 


1140 


cttctggtgc 


aaggttgctg 


cttaagttac 


cctgggaaat 


actactacaa 


ggtcaaattt 


1200 


tctagtatct 


tacagcctga 


ttgaaggtga 


ttcagatctt 


tgctcaatat 


aaatggattt 


1260 


tccaagattc 


tctgggccat 


ccttgaccca 


caggtgatct 


cgctggagta 


tattaactta 


1320 


acttcagtgc 


cagttggttt 


ggtgccatga 


gatccataat 


gaatccagaa 


cttcaccatt 


1380 


gcttagatat 


aagagtccct 


tggaagaata 


atgccactga 


tgatgggggt 


cagaaggtgt 


1440 


attaactcaa 


catagagggc 


ttttagattt 


ttcttcaaaa 


aaatttcgag 


aaaagtattc 


1500 


ttttaccctc 


caaacagtta 


acagctctta gtttctccaa 


atatgctctt 


tgatttactt 


1560 


attttta'gtt 


aaagatggta 


atttattgaa 


caatgaaatc 


cgtaatatat 


tgatttaagg 


1620 


acaaaagtga 


agttttagaa 


ttataaaagt 


acttaaatat 


tatatatttt 


ccatttcata 


1680 


attgttttcc 


tttctctgtg gctttaaagt 


ttttgactat 


tttacaatgt 


taatcactag 


1740 


gtaacttgcc 


atatttctgg 


ttctatatta 


agttctatcc 


tttataatgc 


tgttattata 


1800 


aagctggttt 


ttagcatttg 


tctgtagcaa 


tagaaatttt 


actaagtctc 


tgttctccca 


1860 


gtaagttttt 


tcttttctca gtaagtccct 


aagaaaacat 


ttgtttgcca 


ctcttactat 


1920 


tcccaatctt ggattgttcg agctgaaaaa aaatttgatg agaaacagga ggatcctttt 


1980 


ctggtgaata 


taggttcctg 


ctttaagaat 


gtggaaatcc 


attgctttat 


ataactaata 


2040 


tacacacaga 


ttaattaaaa 


ttgtgagaaa 


taattcacac 


atgacaagta 


ggtaacatgc 


2100 


atgagttttg 


aattttttta 


aaaacccaac 


tgtttgacaa 


aatatagaac 


ccaaattggt 


2160 


actttcttag 


accagtgtaa 


cctcacacct 


cagttttgct 


tttccaaccc 


tgacttgaaa 


2220 
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ggcatatttg 


tatcttttta 


ttagtgatag 


tgaagctgtg 


acactaacct 


tttatacaaa 


2280 


agagtaaaga 


aagaaaaact 


acagcgatta 


agatgagaac 


agttctgcag 


ttgttgaact 


2340 


agatcacagc 


attgtaggca 


gaataaaaaa 


tgttcatatc 


tgagaatatt 


cctttcgcca 


2400 


tcttttccca 


aggccagacc 


tcctggtgga 


gcacagttaa 


aagtaacatt 


ctgggccttt 


2460 


gtaatcggag 


ggctgtgtct 


ccagctggca 


gcctttgttt 


taatatataa 


tgcaggactg 


2520 


tggaaaacag 


ttggcataga 


atattttcac 


ctaaaaaaga 


aaaaaaagac 


atacaaaact 


2580 


ggattaattg 


caaaaagaga 


atacagtaaa 


ataccatata 


actggacaaa 


gctagaagaa 


2640 


cctttagaag 


atttgtctga 


aaacagattt 


caagagtgag 


cttttataca 


ctgctcacta 


2700 


atttgcttga 


ttactaccaa 


ctcttcttaa 


agttaacacg 


tttaaataag 


gtatttctgg 


2760 


acttcctagc 


cttttagcaa 


gcttagagga 


actagccatt 


agctagtgat 


gtaaaaatat 


2820 


tttggggact 


gatgccctta 


aaggttatgc 


ccttgaaagt 


tcttaccttt 


tctctagtga 


2880 


tattaaggaa 


cgagtgggta 


gtgttctcag 


ggtgaccagc 


tgccctaaag 


tgcctgggat 


2940 


tgagggtttc 


cctggatgcg 


ggactttccc 


tggatacaaa 


acttttagca 


gagttttgta 


3000 


tatatgtgga 


tttttctgat 


aagtagcaca 


tcagaggcct 


taaccactgc 


ccaaaagcga 


3060 


ttctccattg 


agagtacata 


tcttgaactt 


aagaaattca 


tttgctctga 


tttttaatct 


3120 


tgtaaagttt 


ttgctaaact 


caaaacaagt 


cccaggcaca 


ccagaaggag 


ctgaccacct 


3180 


taggtgttct 


tgtgatttat 


ccttacttcc 


ctatgttgtc 


atagttgctt 


ctaaactcag 


3240 


ctgcactatg 


gctgtcaaca 


tttctgatac 


ttattgggat 


atgtgccatc 


cagtcattta 


3300 


gtactttgaa 


tggaacatga 


gatttataac 


acaggtaata 


gctgaaggta 


ccagtatggt 


3360 


ggtgagactc 


acacttagtg 


atccagctaa 


ggtaactgat 


gttataatgg 


aacagagaag 


3420 


aggccaacta 


gatagctaag 


ttcttctgaa 


cctatgtgta 


tatgtaagta 


caaatcatgc 


3480 


gtccttatgg 


ggttaaactt 


aatctgaaat 


ttacattttt 


catagtaaaa 


ggaaaccaat 


3540 


tgttgcagat 


ttcttttctt 


gtgaggaaat 


acatggcctt 


tgatgctctg 


gcgtctactg 


3600 


catttcccag 


tctgttctgc 


tcgagaagcc 


agaatgtgtt 


gttaacattt 


ttccgtgaat 


3660 


gttgtgttaa 


aatgattaaa 


tgcatcagcc 


aatggcaagt 


gaaggaattg 


ggtgtcctga 


3720 


tgcagactga 


gcagtttctc 


tcaattgtag 


cctcatactc 


ataaggtgct 


taccagctag 


3780 


aacattgagc 


acgtgaggtg 


agattttttt 


tctctgatgg 


cattaacttt 


gtaatgcaat 


3840 


atgatggatg 


cagaccctgt 


tcttgtttcc 


ctctggaagt 


ccttagtggc 


tgcatccttg 


3900 


gtgcactgtg 


atggagatat 


taaatgtgtt 


ctttgtgagc 


tttcgttcta 


tgattgtcaa 


3960 
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aagtacgatg 


tggttccttt 


tttattttta 


134 
ttaaacaatg 


agctgaggct 


ttattacagc 


4020 


tggttttcaa 


gttaaaattg 


ttgaatactg 


atgtctttct 


cccacctaca 


ccaaatattt 


4080 


tagtctattt 


aaagtacaaa 


aaaagttctg 


cttaagaaaa 


cattgcttac 


atgtcctgtg 


4140 


atttctggtc 


aatttttata 


tatatttgtg 


tgcatcatct 


gtatgtgctt 


tcacttttta 


4200 


ccttgtttgc 


tcttacctgt 


gttaacagcc 


ctgtcaccgt 


tgaaaggtgg 


acagttttcc 


4260 


tagcattaaa 


agaaagccat 


ttgagttgtt 


taccatgtta 


ctatgggact 


aatttttaat 


4320 


tgttttaatt 


tttatttaaa 


ctgatctttt 


tttatatggg 


attacatttt 


ggtgttcact 


4380 


ccctaaatta 


tatggaaacc 


aaaaaaagtg 


attgtatttc 


acatatggaa 


aaaaaaaaaa 


4440 


aaaaaaaaaa 


aatatgcggc 


c 








4461 


<210> 140 
<211> 321 
<212> DNA 
<213> Homo sapien 












<400> 140 
cgaccggcgc 


tttgtgatgg 


atccgcccgg 


gcaggtactc 


tggtagccca 


gtttgcaaga 


60 


gccaggttcc 


agcagggatg 


gtagccagcc 


tgggcacact 


tttctgtatt 


gtatttgaac 


120 


ctcatttcct 


gatgctcatc 


cagaagttcc 


tggctgggtg 


agtaggtgaa 


cagttcaagg 


180 


ctgtgtggat 


gaaactctgg 


catctcttta 


cagtgctgtc 


cggcagggca 


gtgtgaggcc 


240 


actctgtcct 


cctttgacct 


ctttcttctt 


cagcagccaa 


atggaggctc 


tctaaaggtg 


300 


atgattaact 


tggcataaac 


a 








321 


<210> 141 

<211> 1438 

<212> DNA 

<213> Homo sapien 












<400> 141 

gggggacaca gcactgcctg 


aagcagcatc 


ttgtaggtca 


gccagttccc 


gttgacacgc 


60 


agaaagacag 


tactgctggg 


gttcattttc 


tgggaaagcc 


aggcagagcc 


tgaggaagct 


120 


tcatgggaag gcagcatgga 


acttgagctg 


agccacggag 


gatggggaag 


tgggagaagg 


180 


tcatcgaggt 


agctggaaag 


gagagtgttc 


ccagggagca 


ttttagaggt 


gggaacattc 


240 


aggtgcatag 


aggtgagagt 


ggagatgctc 


cagggagttt 


gacagcaagc 


acttggcagt 


300 


gatacagggg 


caggggtaca 


gtctgaaagt 


gctggactgg 


ggtactctgg 


tagccaagtt 


360 


ttgcaagagc 


cagggtttcc 


agcagggatg 


gtagccagcc 


tgggcacact 


tttctgtatt 


420 


gtatttgaac 


cccatttcct 


gatgctcatc 


cagaagttcc 


tggctgggtg 


agtaggtgaa 


480 
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ft r?» or 4— 4- « a a ft ft 

Ca^LLCaa99 


u uy uy tyyou 


y aaau uu i»yy 




uay uy u uy uw 


caacaaaaca 

uyy wayyy wo. 


540 


/■>r I* 01 4* ^"ra o»oro«ot 

gugugaggcc 


at uu uy uuu u 






uay uay u uaa 


ataaaaactc 


600 


4- o» "H a a a ft ft 4" or 

uct.aaa.ggug 


a t~ era t* h a a c t 


j3 uaaau 


a uy uuaayu u 


f* fr r» n t* cr a Cf c cr 
uuuuuyayuy 


tttcfcccfcaa 


660 


ax ugggcugu 


cue uuy u uy u 


uu uy ay auay 


or ot a or a 4~ ororot 

ggagacuggc 


4- 01 4* r* f* or n r* t* cr 
ugucuggcuy 


auu ay u u u u u 


720 


oror or ♦•«n ft ft 

ggggcugagg 


4» o»4*oi , otr , 4»or4- f 

ucuy y u uy uu 


aafit"t*nr , ai fro* 
day u uy cay g 
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Cuay ua.au uu 


r* f" era t~ t* a 4- o« a 
u uyy u uy uua 


7S0 
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y ayauuayay 


Layuay uy u u 


or^ 4* a o*4* a or a 

gc ucac uaga 


ar"4-4-+-of4-oroia 

auu u uy uy ua 


ort*na +* aof*"ort* 

y uy auay uy u 




tctttadcct 


or a /-i 4— or4* 4~ a 


/— Y 4— a 4» or 4" a or/"* 

ytaCtytagc 


A^t* rift ft f* o» ^ 

CauyyyCCac 


o» 4- or*- or a ^ 4- 4- 4~ 

c uyugac U U U 


o> a a orot o< 4* 4» of 

uaay uuu uuy 


qd n 


ctcLduy uy y u u 


auuyy uuayy 


uauay tyyu u 


r«a t~ <"T O* 4" OI f~ 4* 

uauy uu uy uu 


dtccoyyyy u 


rf4-o»o>a a4-o»f-4- 

yuuuaauuuu 


70 v 


4— 4" or or 4" f- nnri 

ttygcccccc 


4»orrrot;o i o>a 

uy yy uuauaa 


4™ rrna a or a a or a 

tyyaaydaya 


at*t" or+- 01 4- 4- oror 

auuguc u uyg 


ofr^otaotaota 4*a 
y CuawauaUo. 


auauauay ua 




a ^ ^ 4™ ^ ^ 4* or a 

acacuaauga 


4* aOr/"»4* /^T ^ 4-OT O 


of ^ 4" 4" 4* a a a a a 

gcuuuaaaaa 


aaaaaLCaCa 


^44as4>^4*/ ,, i4* 

aaaaauauc u 


«if aaf or+" 4~ H 

cauaaugu uu 


i nnn 


u aggaaag u u 


4* a a a 4* 4* 4" /r 

UaCydatCCy 


+* or 4" 4* or or or 4- or 

uguugggcxg 


naf»|» 4- or a a or4- 

cauuugaagu 


ggucccgggc 


4~ 4- oror 4-^4" or 

cuuggucugc 


J. J-*r w 


rrrfflt" orofota f" 1 a 
ggg uyy y aua 


a.arTOi4-4-<-Tt~4~a 
day u uuy iLa 


LaaLLLLay u 


a 4- 4" 4» +• oror or a or 

auu ^yyyay 


or 4* oror a oror 01 a or 

g uyg agg c ag 


ota o"of a 4~ o< a r^H 

y acy a uuau u 


1 900 


tgggcccagg 


agtttgagac 


caggctgggc 


aatatggcgg 


atcccatctc 


cacaaaaatt 


1260 


gcaaagttac 


cgggatggtg 


teaggegcta 


tagtccagct 


gcttgggaga 


ctgagcagga 


1320 


agaatgctta 


gctgtaggga 


cagttgcggg 


eggatacagt 


atggcccagc 


caggcaaggt 


1380 


gacccgtttc 


aaaaaaacaa 


cat tggcegg 


c uxauaggua 


auaaacacaa 


*rs /-trv n 4" or4" a 

acgaugua 




<210> 142 
<211> 368 
<212> DNA 
<213> Homo sapien 












<400> 142 
ggccgaggtt 


agcatttaat 


f* 4~ a o« 4* or or a a 4* 

CuaCuyyaaL 


a a or 0 4* oror 4- 4- or 

aagcuggu eg 


o»4-o»4- ororo»4- 4- a 

cuguggcuua 


4~ a o« ^ 4* or^ a or4- 

uaCCuguagu 


SO 
O w 


cccagatact 


tgggaggctg 


aggcaggaga 


atcacttgag 


accagcctzgg 


f—m s-^ m v% U ^ 4— 4 n 1 

gcaauauagu 


1 O ft 


gagacctcac 


ctctaaaaaa 


— 4- *-f.4.4.1. a A- - 

ytLtttCaua 


aauu taC uug 


4" /■« 4" ^ * a or4* oror 

ucuaaagugg 


fx ft a a a rt/Trr a a 

yyaoaygyaa 


1 Rft 


attattctgt 


tttcttattc 


ttgcttcaag 


actatgacag 


atttgaaaga 


gaattctaaa 


240 


gcagatttag 


agaacctget 


tctcttctta 


tctcctaatc 


cctaaattgt 


aatttagctc 


300 


ttatctgtat 


tgtgttttgt 


tttggtaaag 


ggatgatttt 


tacattgagt 


tttaaagtag 


360 


aataagaa 












368 



<210> 143 

<211> 540 

<212> DNA 

<213> Homo sapien 
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<400> 143 
gccgcccggg 


caggtactaa 


gcatattaca 


ccagtgtgtg 


tggcttcaaa 


tagaagactc 


60 


tctatctcta 


ggtatcgaaa 


tggcgaataa 


gaattgcagc 


tctgaaaata 


taatataaac 


120 


acatttttca 


gatagcatta 


tacaaataaa 


aatacttttg 


gatactagaa 


tttggggtta 


180 


gagctgactt 


caataagggg 


aggaaaaaat 


atgtgcaaag 


ggaagggagc 


tcaatttgga 


240 


ataatatatt 


tcaaagtatg 


tggctttgca 


tttttaaaat 


gctaaccaaa 


aaaaatgtga 


300 


gatagagtga 


ccccaaggaa 


ctttttacag 


ccagcctctg 


gataccagat 


atagcattta 


360 


caacttctga 


tatgctagac 


aaattttcca 


tagttttaat 


tttcttaaag 


acatatagtg 


420 


agtgaacacc 


aacactggga 


aggaggtagt 


gttaatttga 


tcaaatattg 


atatttcccc 


480 


actgattcag 


aaagtgcttg 


ctctttctct 


aggatatctc 


aaaacccttc 


caatttttct 


540 


<210> 144 

<211> 1195 

<212> DNA 

<213> Homo sapien 










i 


<400> 144 
aaagactgca 


tctatggccc 


accttctcat 


aacctcaaac 


gcagagtact 


cattctaaga 


60 


aacacatgca 


ttcttgcctt 


aactgaggac 


tgaaatacca 


aacccaacca 


aaaataacag 


120 


aacttttaag 


tgccctgatg 


gaaatgtaca 


gtatgagtta 


catccatcca 


tctatacaca 


180 


catacagaca 


ctcatatgta 


tatacaatat 


atacatacaa 


atgataatag 


ctataccaaa 


240 


aatgtgcata 


tttaatacag 


ttttaaaaag 


ccagtagatc 


atggtataat 


acaatttatt 


300 


tccactagtg 


aatggcacaa 


atggagaaat 


gcaaaggagg 


tgtttcaagt 


acaatgaagt 


360 


catttttcac 


aggtgaaaat 


aaatagtatt 


attccagtgt 


ccaacataat 


gtattctaca 


420 


aaagtttaaa 


atgccagaca 


tttagtactt 


acagaaatag 


aaagttacta 


aagtctgctt 


480 


tatggtctca 


aaaagacact 


catagagttt 


tcaatttttg 


aaaagggggt 


gagaaaaaaa 


540 


ttggtggaag 


caggagggag 


tgtccaggat 


agtttcttta 


gtgtactgaa 


tgtttacctg 


600 


atgtctaggc 


aaccaaaaat 


catgactatg 


ctaaaatgtt 


ccatttacca 


acaaatcctt 


660 


gacctagagt 


actaagcata 


ttacaccagt 


gtgtgtggct 


tcaaatacaa 


gactctctat 


720 


ctctaggtat 


tgaaatggcg 


aataagaatt 


gcagctctga 


aaatataata 


taaacacatt 


780 


tttcagatag 


cattatacaa 


ataaaaatac 


ttttggatac 


tagaatttgg 


ggttagagct 


840 


gacttcaata 


aggggaggaa 


aaaatatgtg 


caaagggaag 


ggagctcaat 


ttggaataat 


900 


atatttcaaa 


gtatgtggct 


ttgcattttt 


aaaatgctaa 


ccaaaaaaaa 


tgtgagatag 


960 



WO 02/068633 PCT/USO 1/436 12 









137 








agtgacccca aggaactttt 


tacagccagc 


ctctggatac 


cagatatagc 


atttacaact 


1020 


tctgatatgc 


tagacaaatt 


ttccatagtt 


ttaattttct 


taaagacata 


tagtgagtga 


1080 


acaccaacac 


tgggaaggag 


gtagtgttaa 


tttgatcaaa 


tattgatatt 


tccccactga 


1140 


ttcagaaagt 


gcttgctctt 


tctctaggat 


atctcaaaac 


ccttccaatt 


tttct 


1195 


<210> 145 
<211> 787 
<212> DNA 
<213> Homo sapien 












<220> 

<221> mis cofeature 
<222> (344) . . (634) 
<223> a, c, g or t 












<400> 145 
acatttagag 


aaggttaaaa 


gaaacagtga 


gaaatgtaaa 


cattcaaaat 


gataattgaa 


60 


tctctcagtt 


gtgggaataa 


ttatcagaga 


catgcaactg 


aaaatgtctc 


acctttcatc 


120 


tttttttctt 


aattcataaa 


gttatcttgt 


agaatttgat 


gagaccctcc 


tagtcattct 


180 


caactggggc 


ggtgctgtca 


ccgaatggtg 


tttgagagtg 


ttggggctag 


ggcacatttt 


240 


tggttgtcac 


agcaactggg 


gtggcatttg 


ctgcccagtg 


ccaggaatag 


taacattatg 


300 


aatgccagga 


cagtgtgctc 


agtaaagtct 


tccatccaaa 


aggnnnnnnn 


nnnnnnnnnn 


360 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


420 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


540 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


600 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 




catcccataa 


aaaaaaacac 


660 


ccccggttcg 


cttgtccctt 


cccccttgtg 


acaaactttt 


tcaaatccag 


aaagctgcgg 


720 


gtaatcatgg 


gcatagggct 


gttcctgtgg 


taaaattgtt 


ttccggccaa 


tccccaaaac 


780 


aaaaaaa 












787 


<210> 146 
<211> 193 
<212> DNA 
<213> Homo sapien 












<400> 146 

tggtcgcggc gaggtacagg 


ccgcctactt 


actgccctgc 


ctgacccagg 


attcttagga 


60 


ggggaaagtg ctatccctgg 


ggccaggcag 


ggagccacag 


ccactcactg 


cattctcaga 


120 
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attctatcct 


accaccccaa 


agattgacct 


gaaagagact 


ccccttcctc 


agtcatgtag 


180 


aagtccttta 


cat 










193 


<210> 147 
<211> 661 
<212> DNA 
<213> Homo sapien 












<400> 147 
tcatgccgag 


cgcgcgcatg 


agtgatggat 


tggtcgcggc 


cgaggtacag 


agtcctgtta 


60 


tttttctctt 


tggccctatt 


tggctgctta 


tattaatgca 


tcagaacttt 


atgtataatc 


120 


atatcrcrattt 


atacgtaaat 


taagaaaaaa 


tgtccatttc 


attcagttca 


tatgttctaa 


180 


acgtattgct 


gatcattctt 


aaatgagact 


ccaggtttac 


attcttacat 


aaagtgcagg 


240 


gatcccgaag 


ttagccccaa 


agatcccctt 


gcctttttca 


gacttgctca 


aatgttacct 


300 


tatcagtggg 


gcctttcctg 


accacacttt 


aaaaacctca 


acacccaccc 


atgggccttg 


360 


tcctccttcc 


cggcttcatt 


ttttggcata 


tacttatcaa 


atgtgaacat 


atgatgcatt 


420 


tgctttattt 


atcatcgatc 


ttcactcact 


ggcatgtaag 


ctctgtgagt 


gcaaagattt 


480 


tcatctagct 


aatcttccag 


aacagtgtct 


ggcacagaga 


aggagctcta 


tgaatatgtg 


540 


ttgaatgaat 


gactatcttt 


gccttgtaaa 


ccccatgcta 


ttggctctct 


cttcaggtgg 


600 


ctgaccactg 


caccccaggg 


catgctggaa 


agacaggagt 


cccaagccct 


cccttctgct 


660 


c 












661 


<210> 148 

<211> 1897 

<212> DNA 

<213> Homo sapien 












<400> 148 
agttttgcgt 


tgcgccttgt 


tgcttgcgtc 


cgtcgtttgt 


ttgcctgtgg 


cttctgccgt 


60 


ctttttgtgg gctcgccgtt 


gcccgtgcct 


gccgctgtca 


cccttggcgc 


ccgcctgggt 


120 


gctggggtcc 


gcgattgcag 


tcctttgata 


gtgttagtga 


ggggggctgt 


cgtgcgtgtt 


180 


gtgtatggtc 


cgcatggggg 


agtcattagc 


atgttgagtt 


gactgtctcc 


cggtccgttt 


240 


aacgtgcgtc 


tggaaggtac 


atttttgtaa 


atcaagtagt 


tggaactaaa 


tccaacactg 


300 


ataattgcca 


tttcaaacac 


tgatctgaaa 


agtgaattag 


aagctgtaca 


atatcatcat 


360 


tagaaattct 


gcatatggct 


aataaatatt 


ccttttaaaa 


ttaatagagt 


ctaaagtctt 


420 


ccacatgatc 


tttacagata 


gagtgggaca 


ctatagaatt 


ctgattatat 


gatttagatt 


480 
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ttagggatgt 


tttaacattt 


tcaaaccact 


agaaggacat tgggaacaga 


aagtaataga 


540 


gccaacgtca 


cgtggtaatg 


atcaatagtc 


cagttctacg aggagaacaa 


ttttaagctc 


600 


ttcactgagg 


ccaattctgc 


tgtattctaa 


ttccttttag gttcttggtg gtagagtaat 


660 


gagctatgac 


catctctgga 


atactggtga 


ggaaaatggc agcagtaaag 


aaatgaggaa 


720 


aatattacct 


aattaatgat 


aaagttaggt 


ccagtacaga gtcctgttat 


ttttctcttt 


780 


ggccctattt 


ggctgctttt 


attaatgcat 


cagaacttta tgtataatca 


tatggattta 


840 


tacgtaaatt 


aagaaaaaat 


gtccatttca 


ttcagttcat atgttctaaa 


cgtattgctg 


900 


atcattctta 


aatgagactc 


caggtttaca 


ttcttacata aagtgcaggg 


atcccgaagt 


960 


tagccccaaa 


gatccccttg 


cctttttcag 


acttgctcaa atgttacctt 


atcagtgggg 


1020 


cctttcctga 


ccacacttta 


aaaacctcaa 


cacccaccca tgggccttgt 


cctccttccc 


1080 


ggcttcattt 


tttggcatat 


acttatcaaa 


tgtgaacata tgatgcattt 


gctttattta 


1140 


tcatcgatct 


tcactcactg 


gcatgtaagc 


tctgtgagtg caaagatttt 


catctagcta 


1200 


tcttccagaa 


cagtgtctgg 


cacagagaag 


gagctctatg aatatgtgtt 


gaatgaatga 


1260 


ctatctttgc 


cttgtaaacc 


ccatgctatt 


ggctctctct tcaggtggct 


gaccactgca 


1320 


ccccagggca 


tgctggaaag 


gacaggagtc 


ccaagccctc ccttctgctc 


tactccaagc 


1380 


ttttcttctt 


gggtgcattg 


actcaagtca 


ggtagtactt ctctatgtct 


gagcacagac 


1440 


gggctgtgtt 


catgtatttg 


tacatatgtg 


tgaatagaca gagaaactag 


tagcatgggt 


1500 


atgtggggga 


atccatcttt 


tagggagaga 


tttatctact gtttttgtgt 


ttagtctcac 


1560 


ctcagaccag 


gttaagctgg 


ccagggctca 


tagttttcaa agagcaacag 


aaaaaatctg 


1620 


tttagcttac 


attctaagca 


tgttctcttt 


atctttcctg aaagctatcc 


acttttaatt 


1680 


tcatctcata 


ctacagagaa 


aatattattt 


gaaactgata gctttccaga 


aggttactga 


1740 


aatcacttat 


ttttcagtgt 


cttcactggc 


accattcata gtagctaaca 


ttagccactt 


1800 


tccgtgggcc 


tggtgctgtg 


ttaaagtgct 


ttacatatat tatttctttt 


aatccgcaca 


1860 


atgatccttt 


caagtaggta 


ctgttattat 


tcccaat 




1897 



<210> 149 

<211> 254 

<212> DNA 

<213> Homo sapien 



<400> 149 

ccgaggtacc catctagctt ctggggctcc actgacagct gaggacagtc cacacccgac 



ctggacccca ccccaccctg ggtctgtcca tctcagtccc ggcctgagcc tctgggccaa 



60 
120 
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agccacctct tctgagcagg caggcagagc gaaagactgg gagcagcaga caggggcaga 180 
gcacggccca tgagcccacc ctccacttcc cagattggtc agagttacat ggtcaccttc 240 
cctgcacctg cacc 254 



<210> 150 

<211> 1993 

<212> DNA 

<213> Homo sapien 

<220> 

<221> misc_feature 

<222> (1822) . . (1822) 

<223> a, c, g or t 



<400> 150 

aaggatcctt aattaaatta atcccccccc cccggggcgg cccagcggat cgtgccgcgg 60 

cggccgagcg cagctacagg agggtgtcca gaagccacaa gccatggctg tggggaacat 120 

caacgagctg cccgagaaca tcctgctgga gctgttcacg cacgtgcccg cccgccagct 180 

gctgctgaac tgccgcctgg tctgcagcct ctggcgggac ctcatcgacc tcgtgaccct 240 

ctggaaacgc aagtgcctgc gagagggctt catcactgag gactgggacc agcccgtggc 300 

cgactggaag atcttctact tcttacggag cctgcacagg aacctcctgc acaacccgtg 360 

cgctgaagag gggttcgagt tctggagcct ggatgtgaat ggaggcgatg agtggaaggt 420 

ggaggatctc tctcgagacc agaggaagga attccccaat gaccaggttc gcagccaggc 480 

cagattgcgg gtccaagtac cagctgtgcg ttcagctcct gtcgtccgcg cacgcgcctc 540 

tggggacctt ccagccagac ccggcgacca tccagcagaa gagcgatgcc aagtggaggg 600 

aggtctccca cacattctcc aactacccgc ccggcgtccg ctacatctgg tttcagcacg 660 

gcggcgtgga cactcattac tgggccggct ggtacggccc gagggtcacc aacagcagca 720 

tcaccatcgg gcccccgctg ccctgacacc ccctgagccc ccatctgctg aaccctgact 780 

ggtaaacaac tgctgtcaga aaagggctgg gcttgggaag gggaggtgga ggccaggtgt 840 

ccccagacct ctaacccttg cccctagcag cctcttcttt gtggagcctc tcagtgtggg 900 

cagccctcgc atgctggggt cgggccagct ctccccgaaa ggtcttgacc tgaatgatgg 960 

ccggggaagc ctgcgtgtgc ccctttcaga gacggagcac ctgagatgtg ggaggtgcag 1020 

catgttcccc tgggcccctc agaaagtcga gcttggaggc cagcctggat ctgtctctcc 1080 

cttcccctcc tgggaccatt ctacctgtgt tctttgaccc tcggagcagg gacaggcaag 1140 

acaactggca agcttgcagc tgccctgatg gtgcaggtgc agggaggtga ccatgtaact 1200 
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ctgaccaatc 


tgggaagtgg 


agggtgggct 


catgggccgt 


gctctgcccc 


tgtctgctgc 


1260 


tcccagtctt 


tcgctctgcc 


tgcctgctca 


gaagaggtgg 


ctttggccca 


gaggctcagg 


1320 


ccgggactga 


gatggacaga 


cccagggtgg 


ggtggggtcc 


aggtcgggtg 


tggactgtcc 


1380 


tcactgtcag 


tggagcccca 


gaagctagat 


gggtaccagg 


tggggttagg 


ttcccagagg 


1440 


actgagggaa 


tcctgtacag 


gatgtcccag 


ggtagatggg 


gagcaggatt 


gggacctgct 


1500 


ctgacagctg 


gacacatgag 


ccctggatga 


gtatggtagg 


gggtttgaag 


aatcccctgt 


1560 


ccacctccca 


aatccaggcc 


cggccccctc 


tggcttggag 


agcattccaa 


gcccccaccc 


1620 


cacccctaga 


actgccattc 


ccaagacctc 


tgtctcccag 


ccaaccaccc 


ttggaacttg 


1680 


cctcttgtcc 


tgctggaaag 


atagcagtgt 


tctcctgact 


tcgccctact 


gcatgcagcc 


1740 


aaataaaagg 


tgtgcccagt 


ctaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1800 


aaaaaaaaaa 


aaaaaaaaaa 


anaaaaaaaa 


aaaacaaaac 


tattgcttgt 


gtttccgttt 


1860 


ggattggctt 


tgcgtatcta 


ctttcagtgc 


tttcgtcatt 


tccttcttcc 


tagctgctta 


192 0 


tcatgcgata 


ggctacactg 


tgctcctttc 


tcaattttac 


ccagttctaa 


tgggaatggt 


1980 


gtgtttagat 


ttt' 










1993 


<210> 151 
<211> 170 
<212> DNA 
<213> Homo sapien 












<400> 151 
gcgtggcgcg 


gccgaggtac 


atactcatat 


ttataaggac 


ttcctcacta 


ggagagattc 


60 


ctgggttcta 


cagaataaaa 


ttcttggcta 


cttgtcttat 


agctctgaac 


agacttattt 


120 


tcccagagag 


tatgtttatt 


atgtaatagc 


gagttgcctg 


accccccaaa 




170 


<210> 152 

<211> 1394 

<212> DNA 

<213> Homo sapien 












<400> 152 
taggtctgta 


tagaatacga 


ctgaaagatt 


ttgactgttg 


atttcctttg 


aagagattct 


60 


cctgttctcc 


tcttcagagg 


aaagagagag 


gagagaggga 


gagaggagag 


aaaagagaga 


120 


agagagggtg 


acttttcatt 


atatgccttt 


gtaatattta 


atctcctccc 


ttttctcacc 


180 


cccttgagca 


tgtatgcttt 


tcaacaatta 


aaaagattaa 


aaagctctta 


aatagataag 


240 


aatttcttag 


agcaactagt 


aagtacacta 


actgccaatt 


gttttgaaat 


gtagacattc 


300 


tgaaaaatca aaatgattgt 


gccgaatgta 


ttttaaaagg 


ctaaaatatt 


atagcatttc 


360 
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aaagtacata 


ctcatattta 


taaggacttc 


ataaaattct 


tggctacttg 


tcttatagct 


tttattatgt 


aatagcgagt 


tgcctgaccc 


taaggcaaaa 


tgattacttt 


cagattaaga 


tgaagttctt 


taaaaaactg 


aggtcttggt 


tttagaattg 


tcattggggg 


tatctctgac 


aataggagta 


aaatgctcat 


tggcataagc 


atttatagtc 


aatgtgttgc 


catgaacttt 


tttaaaaagc 


aatatggcat 


ctggctctgt 


atatatttgc 


ttttttcccc 


tcttaggtct 


attttatttc 


tacatatatt 


acctacaaat 


catacctgtc 


agtgatacta 


atacctcccg 


ttctcgcatg 


gttcctcact 


tgagcctggt 


tttagtccct 


tgtaaaactc 


tggtataata 


aggattgggg 


aaaagagaaa 


aataaacaaa 


aaaaaaaaaa 


aaaatttggg 


ggcgggccga 


tggaagggca 


ggaaacggga 


gatggtgttc 


gggggtacga 


gggt 





<210> 153 

<211> 368 

<212> DNA 

<213> Homo sapien 

<400> 153 

gcactgagaa aggatatgga caagtcagtc 

gctttggaaa atgttcttca aggatagaga 

agatctttat aagattagac agccagtgga 

tgaggaaatt ctccgccttc cctgacatca 

ataaagtaga tgatcaggca cttgcggttt 

ttaaaagttt tgatttgtag aataatgtaa 
ccattcaa 
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ctcactagga 


agattcctgg 


gttctacaga 


420 


ctgaacagac 


ttattttccc 


agagagtatg 


480 


cccaaaaagc 


tgtgttctat 


catatfcaaaa 


540 


aattgtggga 


ctctagatct 


tgttatatag 


600 


tctgaataat 


agtgggttta 


cattaattta 


660 


ctatttttat 


aaaataatct 


caatttttaa 


720 


cagtaataat 


aatttagtat 


ttttccaagt 


780 


tttaagggat 


gtttttaatt 


ttagaagtgc 


840 


agaagtagaa 


aacatggtaa 


cttcaatgtg 


900 


ttggggtaaa 


aaaaatccaa 


agtttactca 


960 


tatagaggtg 


agacctgctt 


gctgcctgtt 


1020 


tctgtacagt 


gtttcatagt 


ttccaaaggg 


1080 


gaggaaatac 


ctgtaaacat 


gggtagaagt 


1140 


tcaaaaccag 


gaaattgttc 


acagattcta 


1200 


ttgctcagga 


gtagaattga 


aaaaaagaaa 


1260 


aatttaaaca 


gttggggggg 


ccaggtggaa 


1320 


caaaaaagcg 


gggcgggtga 


aaaagggcgg 


1380 








1394 


agcattcaca 


attaagagaa 


aaacatctgt 


60 


attgtgccct 


atgtccacca 


aatttgcatg 


120 


taaggcccct 


tatctttctt 


catggatggc 


180 


gctgcataac 


tgtatttctg 


cctcgtggaa 


240 


gttcttaata 


caagaaagac 


aatttgattt 


300 


gacaatatgt 


ttctttctac 


tttggttttt 


360 



368 
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<210> 154 
<211> 864 
<212> DNA 
<213> Homo sapien 




<400> 154 
ttgatatatt 


gcattcttga 


gcattaggct 


tttagtacac 


agtgctgttt 


atgacagaaa 


actttctgtg 


attcagataa 


aaactttata 


aaccagaaaa 


tgagtcagct 


ttttgtttcc 


cttataatcc 


cgtatagtca 


ccatcaccac 


taagtgtctg 


cacggtcact 


tatgtatacc 


aaattccaga 


aatggaaaca 


tttttgtgta 


gtgttttcca 


cgtagcaaag 


aacatcagcc 


tcgtaccatc 


tgctgagcac 


tgagaaagga 


agagaaaaac 


atctgtgctt 


tggaaaatgt 


ccaccaaatt 


tgcatgagat 


ctttataaga 


tttcttcatg 


gatggctgag 


gaaattctcc 


tttctgcctc 


gtggaaataa 


agtagatgat 


aaagacaatt 


tgatttttaa 


aagttttgat 


ttctactttg 


gtttttccat 


tcaa 



<210> 155 

<211> 179 

<212> DNA 

<213> Homo sapien 

<400> 155 

gcggcgcggt gttatggagt agcgtggttc 
ctacatgttt cacctggtcc tatttttgct 
cttacataca tgttaagtct gatttatctc 

<210> 156 

<211> 1849 

<212> DNA 

<213> Homo sapien 

<400> 156 

gcttgatacg ctcctaagga atttgccctc 
cctgacctca agtgatcagc ccgcctcagc 
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tctaggtgat ttgttaaact 


catagcaggt 


60 


aaaattttat cctacctctg 


aaataattgt 


120 


gaaactccct aatgaaaata 


ttgaagcatt 


180 


aaaatgatgc aacaggaaaa 


cctttaacta 


240 


aaagtattga aaatctgttt 


tctcttttac 


300 


caaagccaga aagatatttt 


tatctcaggg 


360 


atattgattc atttctgtct 


caccaaagat 


420 


ccacgttata gggaacaagc 


gagtcccaaa 


480 


tatggacaag tcagtcagca 


ttcacaatta 


540 


tcttcaagga tagagaattg 


tgccctatgt 


600 


ttagacagcc agtggataag 


gccccttatc 


660 


gccttccctg acatcagctg 


cataactgta 


720 


caggcacttg cggtttgttc 


ttaatacaag 


780 


ttgtagaata atgtaagaca 


atatgtttct 


840 






864 


gcggccgagg tacatgtttt 


taaaaaatga 


60 


atttggacca tacttttaag 


atgaattgat 


120 


cccacatttt taaacactaa 


atgaagctt 


17 9 


gagcaagcaa ttcggcacga 


ggctcgaacc 


60 


ttcccagagt gctgggatta 


caggtgtgat 


120 
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ccactgcacc 


cggccggcat 


tatgattttg 


tgtactcttg 


aaatggttat 


ctttgtggat 


180 


gatttttttt 


ttttaagctg 


aaacttacct 


catgaataac 


ttgattaaag 


tagtaggtga 


240 


ttaaaatttc 


aatagaatca 


aatgagacaa 


aaattttaaa 


ctgactcatt 


tgagtttcaa 


300 


ctttacagtc 


attgaccata 


aagcacacta 


aaaatgtaag 


ttatttttaa 


atacatctga 


360 


aataaaaata 


cttactaaaa 


aggaagaagc 


cgaagatgta 


tatttagacc 


agcacacaat 


420 


tttgatttca 


attagcctta 


ttctaatatt 


tagcttttag 


atctttcata 


cacattttca 


480 


cgtactttgc 


aattgagacc 


agaaagactt 


gtaggtcttt 


ctgcagaatg 


agtgggtcct 


540 


tgcaaagtga 


gtgggaaact 


tactcctaga 


tcagaaatgt 


ttgcctctct 


gagtaaaatg 


600 


tttctttcag 


atgagccata 


gagggggcac 


cttttactca 


acttttcttt 


gttttgaaac 


660 


tttgtttccc 


atactgtttt 


cagccttttg 


tttataatta 


gaaattgtga 


gaagcttcat 


720 


ttagtgttta 


aaaatgtggg 


gagataaatc 


agacttaaca 


tgtatgtaag 


atcaattcac 


780 


ttaaaagtat 


ggtccaaata 


gcaaaaatag 


gaccaggtga 


aacatgtagt 


cattttttaa 


840 


aaacatgtac 


ttggtctttt 


gtgtgtgtct 


gttttattcc 


attagaataa 


atgtgtcctt 


900 


gatgtaaatg 


caaagcattt 


cttcctgatt 


aaattgtaga 


tgtagacttt 


acaatataat 


960 


tcaataataa 


aaagtaatta 


acctctaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaacac 


1020 


ttgttgggcg 


cggcgcgggc 


ccggagaaaa 


gtttttaaaa 


cccttcgtat 


ggcgcgagga 


1080 


ggggccccag 


gtaggggaac 


ggaacacagg 


ggccacggcg 


gtgtagcaag 


agggaaaacc 


1140 


cggtggcgaa 


gaaaaagggc 


gtggattctc 


acccaataaa 


aagcgcgccg 


ggggggaacc 


1200 


ggaaggcgtt 


tgggaaacaa 


gactctcaga 


aagagggaga 


ggcgtcagcg 


gaggcgcaca 


1260 


ataaaggggg 


gactcctcac 


aacaccgggg 


aggagctccc 


gaagaggaga 


cgccccgaaa 


1320 


tagacaacaa 


ttacagcccc 


cgggcgcgcc 


gggggcagat 


accagaagac 


gaagcacgag 


1380 


acgaagtagc 


acaagaagaa 


aggacagaca 


gaagcgagac 


gagaggacag 


aagggaggag 


1440 


agagaggcgg 


aggcgagggc 


gaaaacaagg 


ggacgacagc 


aacgagaaga 


gaaagaaaaa 


1500 


cacgacggag 


gaacggggcg 


gaaaaaggac 


gagaaagccg 


agagcaggcg 


ggcgggggca 


1560 


cacgggggaa 


acgaagagca 


aggaaagaag 


acaacaggag 


aggaggggag 


ggaagcgagc 


1620 


gaagagcgta 


gagacgccga 


gcaacaagaa 


gctagagaca 


gcagtagaca 


cggacagaca 


1680 


gacacggtga 


tggtagccgg 


ggcgggggga 


catcttgggc 


gcgaatgctt 


cgccggggaa 


1740 


cagacagcgc 


agggcgagca 


gagcgaggaa 


acgcagggaa 


cgacgcgaca 


gacaggaggt 


1800 


cagaagaagg 


gaagatgagt 


gcagcgggaa 


gaccaacgga 


gaaggagag 




1849 
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<210> 157 

<211> 903 

<212> DNA 

<213> Homo sapien 

<220> 

< 2 2 1 > mi s c_f ea tur e 

<222> (139) . . (139) 

<223> a, c, g or t 



<400> 157 
ttgtttgttt 


tttttctttt 


tttacatttt tttttttttg 


tggacgaaac 


attcacacag 


C ft 


aaacagctag gaccagatta 


cgccagcttc atgagacctc 


tcctatctgg 


gcacgttgag 


ion 


ttggctgact 


ctgggagcnc 


aggctgttgc ttcccagtct 


ggtggtgaat 


cctccatggt 


ion 
lOKJ 


ctggttgagt 


ccttttcaac 


tagtttttgt gttgtttttt 


gaacctcaag 


tcactgactc 


*5 A ft 


cttcagcagt ggtttaagga 


agtatcttaa tttttaaatc 


acatgttgac 


catgcagcca 


300 


attgttggcc 


cgtagagtga 


acaaaaccag accacaagat 


tgagggcttg 


gagctggaga 


360 


aagggaaaag 


aaaaagcagg 


ctgtgactct ctggggaaag 


aactgaaaga 


tgacactagg 


420 


aattctcaaa 


gcgagaggaa 


aaggaaaggc cctttttcgg 


aaatgacctc 


tgataaacac 


480 


accctccagg gttacacctg 


ccacfcgtgtc tbccacggac 


agacaagctig 


CaCUttayCa 


J* w 


gtcctgaata 


cctagagact 


tccttaacag agagtgggga 


atctcgtcat 


cttgcatggg 


600 


gatgggagct 


cgaagggaga 


acctcagcct tccagaaggt 


tatataaaac 


cagttgagaa 


660 


tttccctaag 


aatggagcag 


tggacaaaca attgttattg 


taatccaaat 


acatgagtct 


720 


acctacataa 


tggagaaatg 


ctaacttaca gaaagcgtca 


ggcttggtgt 


tctcgaactg 


780 


gttttaattt 


ccttttataa 


aattacagcg aattatacaa 


ttacattgat 


taattgtaat 


840 


cagttctgag 


tctagataac 


aaaacacaac acacacaaac 


aaacaaacaa 


cctctcctca 


900 


aac 










903 


<210> 158 
<211> 368 
<212> DNA 
<213> Homo sapien 










<400> 158 

gcggcgcccg ggcaggtaca 


gtggcaacag cattagacta 


agtggaacat 


cccagcaggc 


60 


tgctttagaa 


tccgctcatt 


tgactagata cgatgtaatt 


ggctgtcttt 


aaaaaacgcg 


120 


cacacacaca 


caatctgata 


ggcatatctc atgcccattc 


aatatggaat 


gttcttcgct 


180 


tgctgaattt 


aagcctgtat 


tttaaggttt gtggttcctc 


ggccacaatg 


ggtgatgtca 


240 
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ctgatagaac gaagctgagt ttccaagggt tggggctgtg caagagtaaa cactagagct 300 

tgagttgtta tccagctggc aagcacggaa gtctttgaag aatgtaatgt aaaaagggaa 360 

aagaatgt 368 

<210> 159 
<211> 1548 
<212> DNA 
<213> Homo sapien 

<400> 159 

gtcgcggccg aggtacatag acacaggaca attaataatt tggaaaacaa aagacttact 60 

tatctccata tctgctaatt gtctcccaat ctccttaaaa tgcacttatt agcaatcatt 120 

tttcaccacc catttaccca aacaacagga caaagactca atttcctatt ttatacaaaa 180 

catgataagt cagccaagta ggttctggac cggacaacaa gggagatatt aaattatagt 24 0 

atttatataa aagtggccca ttcgtgtgat acaaaacgtc tcattatgtg gaccaagaaa 300 

catataaaat atatcaatat ataagttgga aaaaataaca aaaaagcaca cacattaata 360 

aataatcata ttacacacac ccacatctat attctcttat atacacgcac acacaactgt 420 

gtgtcttaac aacaaactct ccattttata aaatatctca ctgtatctct ttataaccac 480 

aagaaagatg gaaacacaat aaacaccaaa aacaaagaca cacagaaatc atttgtgcgc 540 

ctatatataa taatcacact cagtatattt aaccaacact tcccccccca tatcatctaa 600 

tttatgtagt actctataat gtatcatcta gtatattatg tagaaaagtg gcgccccata 660 

tattatacag ccgggcgccc acgtgtatat caaacaacag ctgttgtcgt tctgcccggg 720 

gagcacaaaa ggtgtataac aacaaatgat tatattcttc acccgaggaa gaagataatg 780 

agaagggagt aataataacc agcgaagaag gggcgggggg acataataac accgctaatc 840 

aacaccaaag aacgcgggcc aaacgaaaca acaacacaaa cgagatgttg tgccacacct 900 

cagtaacacc tacatgtcag cgcgcagcga caaaacaagg gcatgaccag aagaaacctg 960 

aaaaaactgc ggcaaaggga accacaccac cgacacacac accccaatga ggaaagagaa 1020 

taaaaaacac atagttgagg acaccaaaaa aaggacagga ggaaaaacac actgtggggt 1080 

gtgcaaaaca cacgttgtcg taacaacacc gcaggagatc acaaaagaaa acaccccaag 1140 

acacaacaaa atataagcga acaccacaac aacacaacaa cgaaacaaaa caagaaaaaa 12 00 

caaaacacac aaccgaaaca aacaaccaaa acgaccacca gaagcacaca acacgagacc 1260 

aggacaacac cagacgcaga cagacacgag atgatgcaca cgcgcatcgg ccgagccagc 1320 

aaacggcacg ccaacaagca agacggccag cggagaacac gagcatgtcg aagtaggaca 1380 
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gcccgccaag acgacaaaaa accgaggaga agagaggaag aacacagaaa ccgcgaggaa 1440 
aggacaacgc aggaggcgag ataacggcgg acaagcgcaa gaagggagca gagaagacag 1500 
aaggcaagac acgaaacaaa aacacacaac gaacaaacac gcgagagg 1548 

<210> 160 

<211> 552 

<212> DNA 

<213> Homo sapien 

<400> 160 

agaagactga ctcatatagg gcgatggtca ctagatcatg ccgagcggcg ccattgtgat 60 

ggatctagct ttggctactg ccggtagtgg acaatatggc acatggaaat taaaaagtcc 120 

ataaacgtgc cctcctaaca cgagaataag aaaggtggct gaagtagata atttcagtga 180 

cggaggggat gaaatatttt ttggttaatt gatgtaatga tgactcacta tgccttattt 240 

cctattttta aaaacacaga atgagcaagt cattcctgaa caaaaattta ctgtgtgtat 300 

aacatacacc tcaaaatgaa ttttaaggga acatattact aatcaaataa cacagtttat 360 

gctttttcaa tttccacaaa ttgttaatta tgatacttaa gggaaccctt acaatatata 420 

acaagtcatt tcaatattat tcatcatcct taacttctga aagtttggtt tatgttatct 4 80 

tatctagaaa gaaaactact tacaaatctc attttcccac aaaattaatt caacatccaa 540 

ccttaaaaat aa 552 

<210> 161 

<211> 3937 

<212> DNA 

<213> Homo sapien 

<400> 161 

tgtagctggg attacaggca tacaccacca cacccagcta tttttttgta tttttgttag 60 

agacaggttt ctccatgttg gtcaggctgg tctcgaactc ccgacctcag gtgatccgcc 120 

cacctcggcc tcccaaagtg atgattttat ttttattttt aattgtatta atctgcatct 180 

agataatgac tttgtgaagg gtgtgtgttg tgtgcaaaac ttaaggtatt ggttgagagt 240 

taaatacatt atttttataa tatgttgggt atagtctagt tactaatgat tttttttaag 300 

tacttttata aaaagttcat ttttaaaatt gtttgttttt aaaagccaat atacgttgca 360 

gaattaggaa cagtatttat atttatttac acaagacatt gtgccatagc atcctagtaa 420 

aacaccttca tgaatgagta atgttatctc ccagaattac attaaaatta tttctaaaaa 4 80 

gtagcaaagt cattaccttt tgcttttaat gacccacacc tcaccagctc ctggtctttt 540 
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cttcactgtt 


gcccttattt 


tgaggcaatt 


tttcttaaaa 


tatgactttt 


atgcaccaca 


600 


tttagtagag 


gcagtgacat 


cagtgatctc 


agtacccatc 


agctgtcccc 


ctcctctgcc 


660 


cttcttcatc 


tcttctacct 


tgtgaccatt 


tcccttaccg 


gttcatgttc 


tcctttatct 


720 


ctgcttttct 


ttcttagcca 


ggatacttcc 


ctcacaactc 


tcactcccaa 


attcttttag 


780 


atatacattt 


ttctggatat 


tggctgctga 


aatctgaagc 


tctggtaaag 


ttcctagtat 


840 


cagagatcaa 


tcctggagga 


ggcctagttc 


actattagat 


tacaaagact 


cctcacaaag 


900 


taaaggaaaa 


tcaccttcaa 


aaccacaacc 


ctttatgttg 


tcaagtctaa 


tatgagtgtt 


960 


tttacgaagt 


atttctttct 


acccattgtt 


caagaatgta 


aatgtaaaaa 


aaaatacaag 


1020 


agagttgggt 


agatatgcat 


gcttgaggaa 


acttgctttt 


actgttttcc 


tacttgtatc 


1080 


cccagttcag 


ttgaatttac 


aaggacctac 


aagatggtca 


tgtttgtctt 


ggtatgtgct 


1140 


accccaattt 


tagtgtttct 


ttctttattt 


taaatcagta 


attattcagt 


tgattgttta 


1200 


tactatataa 


tgaagtaaca 


aaaacatttt 


ggtttgtatg 


ttttaagtaa 


cagttgtgca 


1260 


aattcctctt 


gtttgttagg 


tgctcccttt 


gaatattttg 


tgaactgtgt 


cagagggaga 


1320 


ggggtggtgg 


ctaggaagag 


ggtcagaaag 


aagctagagg 


gaggtcagga 


gaagggtaac 


1380 


agggaggatg 


caaagcagac 


atctaccctg 


gtcaccccag 


gatcaggata 


tctgtccttg 


1440 


tttcatgttg 


aattcaaaaa 


ttggatctca 


cttaggcttt 


gaaggtgaca 


gccatctctg 


1500 


atagctgagc 


ataagtaaag 


aaaggtggag 


tgccgatgca 


gaaaggaaaa 


tattcagctt 


1560 


tcctctctta 


gatcgcactt 


tgaagatggc 


cttttggaga 


caatctgaca 


ggttaaaaca 


1620 


ggaactgttg 


gaattattct 


agctgtaact 


acctattggc 


tatgtgttga 


ttgatcctag 


1680 


aaagaaaaaa 


taatttttca 


ttttagatct 


tgattgaatt 


taagatgtat 


ttatatgcct 


1740 


acaaaaggtc 


tgtcttgtaa 


ctgttgtata 


aaataaacct 


aatctatggt 


ttcattttta 


1800 


atctaaaaaa 


agttgtgcct 


taacaatagg 


gcattgtatg 


ttaataaggg 


aaaacaacct 


1860 


ttttagtaga 


tgggggaaaa 


taggaacttt 


ttgccattaa 


aacttaagtt 


cttttgatgt 


1920 


ttttaatatt 


atagttgggg 


gagattcatt 


aaaattaaat 


tgaaataaaa 


ttatttttgc 


1980 


ataacctagc 


atttacaact 


aaagtatgtt 


ttttataaga 


actggcatct 


tgatgtatat 


2040 


aggtctgaaa 


taatatttca 


tcttttgatt 


tttaatttta 


ataatattag 


accaggatag 


2100 


atcacagttt 


tacaaatctt 


agttttaaat 


aaattatttc 


agtgtgctgt 


tagtcctcta 


2160 


cagtcatttt 


ggtttaaaaa 


gtgactattt 


atttatggta 


gcatatcaat 


aatttattaa 


2220 


tgttaaaaaa 


tactgtgtat 


gacattacaa 


accagaacag 


ttcctggggg 


agaggattct 


2280 


aattgattgg 


cagttctgag 


agggcaagaa 


gaatggaact 


ttatacttca 


aaaggaggtt ' 


2340 
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ttggttttac 


caggtactgc 


ttatgtaaat 


cgtttatttt 


tatttcatca 


aagcctggca 


2400 


agtatatgca 


ttccaattta 


ccattggcaa 


agctttattt 


atttttaagg 


ttggatgttg 


2460 


aattaatttt 


gtgggaaaat 


gagatttgta 


agtagttttc 


tttctagata 


agataacata 


2520 


aaccaaactt 


tcagaagtta 


aggatgatga 


ataatattga 


aatgacttgt 


tatatattgt 


2580 


aagggttccc 


ttaagtatca 


taattaacaa 


tttgtggaaa 


ttgaaaaagc 


ataaactgtg 


2640 


ttatttgatt 


agtaatatgt 


tcccttaaaa 


ttcattttga 


ggtgtatgtt 


atacacacag 


2700 


taaatttttg 


ttcaggaatg 


acttgctcat 


tctgtgtttt 


taaaaatagg 


aaataaggca 


2760 


tagtgagtca 


tcattacatc 


aattaaccaa 


aaaatatttc 


atcccctccg 


tcactgaaat 


2820 


tatctatttc 


agccaccttt 


cttattctcg 


tgttaggagg 


gcacgtttat 


ggacttttta 


2880 


atttccatgt 


gccatattgt 


ccactaccgg 


cagtagccaa 


agctagctgt 


ttcagtccca 


2940 


cagaagagac 


agtgctctgc 


catgatgaca 


gggcactgct 


agggctggtt 


tttcttgttt 


3000 


ttcccttttg 


gcagtgtgga 


cttcaggaac 


tagatgtata 


tgcacaaggg 


attgagttta 


3060 


cactaaaact 


aggaaatgga 


gttttcaatc 


tatgttcttg 


cctcttcata 


cttttattta 


3120 


ttttttgtca 


tcctgcctta 


tactgggcta 


acaatgagat 


aaaataaaaa 


tacctttgaa 


3180 


tactcttttc 


cctttcatgc 


atttaaagcc 


atggaggaac 


tagaccatta 


gctgttgccg 


3240 


tcacatgctt 


agacaccagt 


ttacttagcg 


tgttatgacc 


ttcctcaccc 


atactaccaa 


3300 


atttaaatgg 


gtcccgactt 


caccctctgg 


aaggaagtaa 


actcttctct 


ccccatggtt 


3360 


tcagagcagt 


ttttacctgc 


aagcaccatc 


tctgtatgtg 


ctcttactag 


attatacagt 


3420 


tcttgagagg 


gattgcatct 


tggtgttttt 


gtatttccac 


ctcaccccca 


gcacatagcc 


3480 


cagtctcttg 


cacaaattaa 


gtacttaatg 


tgtgttgagc 


taaattgaat 


aaaggattat 


3540 


tagcattagc 


atattttgtg 


ccttggttgt 


ataagctggt 


tgtttgtttt 


gttacctttg 


3600 


caaatattta 


tgattatcac 


ccccccacat 


actaaattgt 


ttttaaaagt 


tttgcctttc 


3660 


cttcagatac 


taccccaggc 


aatttgctgt 


agataatgtg 


attgcttcca 


atgacataat 


3720 


tatcccaaac 


tctctgcccc 


ggatatactt 


tgccaaacga 


aatttgaatt 


ctctgaataa 


3780 


attggtcatg 


tctaaaaaaa 


aaaaaaaaaa 


aaaattgggg 


gcgggcgaaa 


aattttaaac 


3840 


atgtgggggg 


gggcaatgaa 


agaaggcagg 


tccaagacgg 


ggaagggggt 


cacaaaaagc 


3900 


ggggggttga 


aaaggggggg 


tggtataggt 


aacgggg 






3937 



<210> 162 
<211> 852 
<212> DNA 
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<213> Homo sapien 



<400> 162 
caaccaccca 


aac acra tacc 


accatgccca 


gctaattttt 


atatttttag 


taqagatggg 


60 


afctttaecafc 

^1 www W W V W*i w 


attggccagg 


ctggfcctcaa 


actcctgaac 


tcaagtgata 


acacccacct 


120 


fccrcrcefcccca 

W^J^J w Wj W W; W> W C*. 


aaatactaaa 


attacaggtg 


tgagccacca 


cactQQQCca 


atgettaata 


180 


ttttaatgta 


tctcaacaat 


aaaaccaaga 


agaaacaaag 


ccttttgact 


tgttagaatg 


240 


tattaacrtacr 


tattttaaag 


aaactttata 


gttgtgacat 


tgaaagactg 


ttqgggtggg 

333 3 w 333 


300 


crcrcr a aaaaa a 


tttttacttt 


ccatcttaat 


gtaaccttat 


gctattctgt 


atttttactg 


360 


tatattgctt 


ttacaataaa 


tataaaatga 


aaatgtttat 


gttgacaaaa 


agaacaaaaa 


420 


acaacaaaca 


acaaaaaaca 


aaaggctggg 


ggtgtcacac 


ctgtgggcca 


aaagctggtt 


480 


t c c c t gocrcrcf 


fcacracatttcf 


gttttatccc 


ggccccacaa 


ttccccaccc 


aaatattacc 


540 


gggagacaac 


gggaagaacg 


acacaacaca 


caaaaagaca 


caacacacaa 


aaccaccaca 


600 


cagcaacgcc 


cgctcatcgg 


aggcagcgca 


caagacgaga 


gcagaaggaa 


aagggacaac 


660 


aaaaaaaagc 


aagtagcacg 


atcacaacac 


agagccacga 


caagaagaga 


aggacaatga 


720 


cgaaaacgag 


cagcagcaca 


acracacaaac 


aagacaaaag 


caagaaagac 


agaacacgac 


780 


agaagacaac 


aagagacgaa 


acaaccaaaa 


agccacacaa 


aaaagegage 


gaaaacaaga 


840 


accaactaac 


ac 










852 


<210> 163 
<211> 685 
<212> DNA 
<213> Homo sapien 












<400> 163 
caccgatgat 


taaccagtac 


acracoctcrtt 

j *A w W^J W W 


ccact agate 


atcraccracfccr 


cffctcgcaatg 


60 


ttatagtagt gcgcggctga 


catacttcat 


cfcacfcfcttfca 

W? w ^-J w W W W W WW* 


ttattattat 


tgtgtttttt 


120 


aaccaaaatg gcatttgttt 


cr a e fc a fc e a fc fc 


crfcccracifcafca 

y uvyoy o d i— d 


tatttattat 


gttttacaaa 


180 


catgagtata tatgtatgta 


tatatataaa 
UQLy i>a l> Add 


ccaaaethat 


afcafcafcacraa 

d L. d L- d U» By dd 


cr tic aacrcrcrc a 

W W w* *J *j W w* 


240 


tgtatactag 


atattttaaa 


gagttattta 


tcaagggaaa 


aagatgtgtg 


ttataaagta 


300 


aacagagtct 


atattttcta 


tataatgtag 


atagtcaaac 


atagcttata 


aattatagga 


360 


ggttttggtt 


ttctttttta 


ttattaaagg 


aaaaaaagaa 


gaaaaaaaaa 


gacaatgaat 


420 


gcaactgttc 


ttagtgtttt 


taaggccaaa 


ctactgtgga 


agctgggagg 


cggccctccc 


480 


tgcgggcctc 


cagcagccct 


gtgcctgccg 


ctaggagctc 


cagagctgat 


gcccgttgtg 


540 


atctctgcga tgctcgatgc 


tcggagccag 


eggtcagcat 


cactcagcca 


gctggcctgt 


600 
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gccgctctga 


cgtggcttcc 


agctgttctc 


cgcaatttgc 


attggtggga 


taaaggaatg 


660 


aaaaggataa 


ataaagattt 


aaaag 








685 


<210> 164 

<211> 2396 

<212> DNA 

<213> Homo sapien 












<400> 164 
cggccactga 


attcccttgc 


ggccgcagaa 


tttttttttt 


ttttttttgt 


attttctttt 


60 


aaatctttat 


ttatcctttt 


cattccttta 


tcccaccaat 


gcaaattgcg 


gagaacagct 


120 


ggaagccacg 


tcagagcggc 


acaggccagc 


tggctgagtg 


atgctgaccg 


ctggctccga 


180 


gcatcgagca 


tcgcagagat 


cacaacgggc 


atcagctctg 


gagctcctag 


cggcaggcac 


240 


agggctgctg 


gaggcccgca 


gggagggccg 


cctcccagct 


tccacagtag 


tttggcctta 


300 


aaaacactaa 


gaacagttgc 


attcattgtc 


tttttttttc 


ttcttttttt 


cctttaataa 


360 


taaaaaagaa 


aaccaaaacc 


tcctataatt 


tataagctat 


gtttgactat 


ctacattata 


420 


tagaaaatat 


agactctgtt 


tactttataa 


cacacatctt 


tttcccttga 


taaataactc 


480 


tttaaaatat 


ctagtataca 


tgccttgact 


tcttatatat 


ataagtttgg 


ttttatacat 


540 


atacatacat 


atatactcat 


gtttgtaaaa 


cacaataaat 


atatactcga 


caatgacagc 


600 


caaacaaatg 


ccattttggt 


taaaaaacac 


aacaacaaca 


ataaaaagca 


gatgaagtac 


660 


tataaaagca 


ccaggcagca 


gacagaaagc 


cacatttgct 


agaaacttct 


ccctcccctg 


720 


ggcaaggcac 


agggcagaat 


actaaatacg 


tccaggtgcc 


tgaaagagaa 


ggaaggcagc 


780 


aaaggaatgg 


gtcagatcac 


aagcttttgt 


tttgtttctt 


agcctgggat 


tagaccaaga 


840 


atcacaagta 


agtcattgcg 


tttataagga 


aaaaccaagg 


ggctcattca 


acagcttagc 


900 


ccttgtggtg 


gctgcagggg 


acagtagaga 


cctggaaggg 


gaaggagaga 


gacagtgacc 


960 


tggtgacaac 


tccatgacta 


ttgtctagcc 


cctgcctaca 


tctgcccagg 


caagcttttg 


1020 


attgccacac 


taagcatcaa 


gccattgcat 


ataatatgtt 


cttttcatga 


ctctattcac 


1080 


tgctggtgta 


aacagaaagg 


agtattcagt 


agtgcaacac 


tttgggaggc 


tgaatgagag 


1140 


ccacagtttc 


ggtggccccc 


tcaagggctc 


cagcaaatgg 


ggactgtgac 


gtgggggaga 


1200 


aaaaggccag 


tgggccattg 


ccttccactc 


ctgagacagt 


accggcttca 


cctccagatg 


1260 


ccactgggaa 


cactgagccc 


atcacctttt 


aaagaagcgc 


aggaggtctt 


gactcttcct 


1320 


atgagaagtc 


ccagaagccc 


cagtcagtct 


gggtgggggg 


tcccctgaaa 


ctcagagcac 


1380 


gctccagctg 


ctggctcctg 


tgcctggcgg 


ggagaaggtg 


gagaggcggg 


catgcctgcc 


1440 
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gctgagaaag 


acataataac 


gctccagctg 


cacacccctt 


ttcttccacg 


gccctcctct 


1500 


ttctccccaa 


aaggaaacta 


aaatgtgggg 


ttctgatcat 


tgatttttaa 


acaagctccc 


1560 


caatgtcatg 


gcctctgctc 


tgtaaactcc 


aaggctctga 


tggccaaaaa 


cgtaggcggt 


1620 


ggtggagcag 


taggcggtgt 


caaggctaga 


tttgtattaa 


tatctaggag 


tggggtgggg 


1680 


atgcaggtgg 


agggtgagcc 


aagtagacca 


agcctcagga 


aatcccagag 


gtgaagattt 


1740 


gatgcacatc 


ccccatcctt 


gccaaataaa 


atacaagtta 


gaggtcattg 


ttcacattac 


1800 


agtcacaatg 


ggacatttta 


aacagctcgt 


tttctgagta 


cattcaagac 


ataattgaat 


1860 


ttattttaaa 


aaatggattt 


cccggctggg 


cacggtggat 


cacgcctgtc 


atcccagcac 


1920 


tttgggaggt 


caaggaagcc 


ccttcttcac 


aggacatcct 


ggtgttcctc 


actgggcagg 


1980 


aggagatcga 


agccatgagc 


aagacgtgcc 


gagacattgc 


aaagcacctc 


ccagacggct 


2040 


gccctgcgat 


gctggtcctt 


cctctgtacg 


cctccctgcc 


ctatgcacag 


cagctccgag 


2100 


tcttccaagg 


ggccccaaag 


ggctatcgca 


aagtgatcat 


ttcaaccaac 


atcgctgaaa 


2160 


cctccataac 


cattacagga 


ataaaatatg 


tagttgacac 


gggcatggtt 


aaagcaaaga 


2220 


agtataaccc 


tgacagtggt 


cttgaggtgt 


tagcagtgca 


gcgggtatcg 


aagacgcagg 


2280 


cttggcagcg 


cacagggagg 


gctggcagag 


aggacagtgg 


catctgctac 


cggctctaca 


2340 


cggaggacga 


gtttgagaag 


tttgataaga 


tgaccgtgcc 


agagatccag 


aggtag 


2396 



<210> 165 

<211> 11 

<212> PRT 

<213> Homo sapien 

<400> 165 

Met Arg Tyr Leu Pro Gly Leu Ser Ala Arg lie 
15 10 

<210> 166 

<211> 45 

<212> PRT 

<213> Homo sapien 

<400> 166 

Met Ser lie Pro Arg Ala Glu He Ser Leu Leu Glu Ser Phe Gin Leu 
15 10 15 



Thr Ser Thr Val Ala Thr Ser Glu Ser His Lys Ser Asn Gly Ser Cys 
20 25 30 
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Arg Lys Pro His Leu Leu His Cys Pro Arg He Asn Gin 

40 45 





35 


<210> 


167 


<211> 


37 


<212> 


PRT 


<213> 


Homo 


<400> 


167 



Met He Leu Gly Ser Asp Asn Gly He Arg Arg He Lys Tyr Leu Gly 
15 10 15 



He Gin Tyr Tyr Ala Cys Ser Phe Phe Gin He Val His Gly Gly Gly 
20 25 30 



Gly Cys Val Ser Gly 
35 



<210> 168 

<211> 82 

<212> PRT 

<213> Homo sapien 

<400> 168 

Ser Leu Ser Val Ala Gin Ala Arg Val Gin Trp Arg Asp Pro Gly Ser 
15 10 15 



Leu Gin Pro Leu Pro Pro Gly Phe Lys Arg Phe Leu Ser Leu Ser Leu 
20 25 30 



Pro Ser Ser Ala Gly Tyr Arg Arg Ala Pro Pro Pro Cys Pro Ala Leu 
35 40 45 



Leu Tyr Phe Ala Val Glu Thr Gly Phe His His Val Gly Gin Ala Gly 
50 55 60 



Leu Glu Leu Leu Thr Ser Gly Asn Pro Ala Pro Pro Arg Pro Pro Lys 
65 70 75 80 



Val Leu 



<210> 169 

<211> 103 

<212> PRT 

<213> Homo sapien 
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<400> 169 



Met Ala lie Phe Ser Ala Leu Ser Gin Leu Leu Glu His Gly Leu Asp 
15 10 15 



Leu Glu Thr Ser Asn Lys Asp Phe Thr Ser lie Pro Ala Ala Cys Trp 
20 25 30 



Trp Val lie lie Ser Met Thr Thr Val Gly Tyr Gly Asp Met Tyr Pro 
35 40 45 



lie Thr Val Pro Gly Arg He Leu Gly Gly Val Cys Val Val Ser Gly 
50 55 60 



He Val Leu Leu Ala Leu Pro He Thr Phe He Tyr His Ser Phe Val 
65 70 75 80 



Gin Cys Tyr His Glu Leu Lys Phe Arg Ser Ala Arg Tyr Ser Arg Ser 
85 90 95 



Leu Ser Thr Glu Phe Leu Asn 
100 



<210> 170 

<211> 131 

<212> PRT 

<213> Homo sapien 

<400> 170 

Arg Thr Ala Arg His Asp Tyr Ala Ser Cys Arg His Leu Met Val Phe 
15 10 15 



Ser Thr Arg Leu Thr Leu Lys Arg Cys Tyr Arg Glu Met Val Met Leu 
20 25 30 



Leu Val Phe He Cys Val Ala Met Ala He Phe Ser Ala Leu Ser Gin 
35 40 45 



Leu Leu Glu His Gly Leu Asp Leu Glu Thr Ser Asn Lys Asp Phe Thr 
50 55 60 



Ser lie Pro Ala Ala Leu Leu Trp Val He He Ser Met Thr Thr Val 
65 70 75 80 



Gly Tyr Gly Asp Met Tyr Pro He Thr Val Pro Gly Arg He Leu Gly 
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85 90 95 



Gly Val Cys Val Val Ser Gly lie Val Leu Leu Ala Leu Pro He Thr 
100 105 110 



Phe He Tyr His Ser Phe Val Gin Cys Tyr His Glu Leu Lys Phe Arg 
115 120 125 



Ser Ala Arg 
.130 



<210> 171 

<211> 23 

<212> PRT 

<213> Homo sapien 

<400> 171 

Met Val Ala His Cys Ser Leu Pro Val Pro Val Thr Leu Pro Asn Cys 
15 10 15 



Pro Val Ala Cys Arg Leu Ala 
20 



<210> 172 

<211> 57 

<212> PRT 

<213> Homo sapien 

<400> 172 

Met Asn Gly Arg Gly Leu Ala Arg Gin Gly Cys Glu Ser Gly Asn Ala 
15 10 15 



Phe Phe Thr Pro Met Asn Phe Cys Leu He Leu Thr Thr Glu Gin Glu 
20 25 30 



Cys Ser Glu Thr Glu Val Gly Val Thr Asn He His Phe Pro Phe Ser 
35 40 45 



Ser Cys Ser Asn Gin Tyr Phe Lys Lys 
50 55 



<210> 173 

<211> 50 

<212> PRT 

<213> Homo sapien 



<400> 173 
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Met Ser Gly Glu Arg Thr Asp Tyr Ala Pro Leu Asn Asn Lys Leu Cys 
15 10 15 



Ser Arg Val Arg Ala Val Ser Ser Glu Ala Leu Leu Thr Glu Thr Ala 
20 25 30 



Thr Phe Lys Pro Asn Gin Arg Lys Tyr Leu Trp Asn Ser Leu Pro Gin 
35 40 45 



Phe Gly 
50 



<210> 174 

<211> 31 

<212> PRT 

<213> Homo sapien 

<400> 174 

Met Val Thr Asn Leu Leu Thr Ala Gin Cys Gly Trp Met Val Ala Ala 
15 10 15 



Arg Cys Phe Cys Cys Leu Asn Leu Leu Asn Cys Leu lie Phe Ser 
20 25 30 



<210> 175 

<211> 456 

<212> PRT 

<213> Homo sapien 

<400> 175 

Met Glu Pro Arg Cys Pro Pro Pro Cys Gly Cys Cys Glu Arg Leu Val 
15 10 15 



Leu Asn Val Ala Gly Leu Arg Phe Glu Thr Arg Ala Arg Thr Leu Gly 
20 25 30 



Arg Phe Pro Asp Thr Leu Leu Gly Asp Pro Ala Arg Arg Gly Arg Phe 
35 40 45 



Tyr Asp Asp Ala Arg Arg Glu Tyr Phe Phe Asp Arg His Arg Pro Ser 
50 55 60 



Phe Asp Ala Val Leu Tyr Tyr Tyr Gin Ser Gly Gly Arg Leu Arg Arg 
65 70 75 80 
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Pro Ala His Val Pro Leu Asp Val Phe Leu Glu Glu Val Ala Phe Tyr 
85 90 95 



Gly Leu Gly Ala Ala Ala Leu Ala Arg Leu Arg Glu Asp Glu Gly Cys 
100 105 110 



Pro Val Pro Pro Glu Arg Pro Leu Pro Arg Arg Ala Phe Ala Arg Gin 
115 120 125 



Leu Trp Leu Leu Phe Glu Phe Pro Glu Ser Ser Gin Ala Ala Arg Val 
130 135 140 



Leu Ala Val Val Ser Val Leu Val lie Leu Val Ser He Val Val Phe 
145 150 155 160 



Cys Leu Glu Thr Leu Pro Asp Phe Arg Asp Asp Arg Asp Gly Thr Gly 
165 170 175 



Leu Ala Ala Ala Ala Ala Ala Gly Pro Phe Pro Ala Pro Leu Asn Gly 
180 185 190 



Ser Ser Gin Met Pro Gly Asn Pro Pro Arg Leu Pro' Phe Asn Asp Pro 
195 200 205 



Phe Phe Val Val Glu Thr Leu Cys He Cys Trp Phe Ser Phe Glu Leu 
210 215 220 



Leu Val Arg Leu Leu Val Cys Pro Ser Lys Ala He Phe Phe Lys Asn 
225 230 235 240 



Val Met Asn Leu He Asp Phe Val Ala He Leu Pro Tyr Phe Val Ala 
245 250 255 



Leu Gly Thr Glu Leu Ala Arg Gin Arg Gly Val Gly Gin Gin Ala Met 
260 265 270 



Ser Leu Ala He Leu Arg Val He Arg Leu Val Arg Val Phe Arg He 
275 280 285 



Phe Lys Leu Ser Arg His Ser Lys Gly Leu Gin He Leu Gly Gin Thr 
290 295 300 



Leu Arg Ala Ser Met Arg Glu Leu Gly Leu Leu He Phe Phe Leu Phe 
305 310 315 320 
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lie Gly Val Val Leu Phe Ser Ser Ala Val Tyr Phe Ala Glu Val Asp 
325 330 335 



Arg Val Asp Ser His Phe Thr Ser He Pro Glu Ser Phe Trp Trp Ala 
340 345 350 



Val Val Thr Met Thr Thr Val Gly Tyr Gly Asp Met Ala Pro Val Thr 
355 360 365 



Val Gly Gly Lys He Val Gly Ser Leu Cys Ala He Ala Gly Val Leu 
370 375 380 



Thr He Ser Leu Pro Val Pro Val lie Val Ser Asn Phe Ser Tyr Phe 
385 390 395 400 



Tyr His Arg Glu Thr Glu Gly Glu Glu Ala Gly Met Phe Ser His Val 
405 410 415 



Asp Met Gin Pro Cys Gly Pro Leu Glu Gly Lys Ala Asn Gly Gly Leu 
420 425 430 



Val Asp Gly Glu Val Pro Glu Leu Pro Pro Pro Leu Trp Ala Pro Pro 
435 440 445 



Gly Lys His Leu Val Thr Glu Val 
450 455 



<210> 176 

<211> 28 

<212> PRT 

<213> Homo sapien 

<400> 176 

Met Ser Tyr Asn Ser Lys Leu Glu Ser He Arg Leu Lys Arg Val Ser 
15 10 15 



Met Lys Thr He Pro Lys He Pro Phe Thr Gin Asn 
20 25 



<210> 177 

<211> 91 

<212> PRT 

<213> Homo sapien 



<400> 



177 
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Met Ala Leu Gly Ser Met Tyr Leu Val Leu Thr Leu lie Val Ala Glu 
15 10 15 



Val Leu Arg Gly Ala Glu Pro Cys Cys Gly Pro Leu Lys Tyr Arg Val 
20 25 30 



Leu Arg Pro Cys Pro Leu Pro Val His Cys Ala Pro Pro His His Gin 
35 40 45 



Pro Ser Arg Gly Asn Pro Val Ala Cys Leu Pro Thr Tyr Lys Val Val 
50 55 60 



Tyr Gin Ala Ala Val Leu Ala Thr Ala Phe Lys Phe Gin Cys Asp Leu 
65 70 75 80 



Pro Gly Arg Ser lie Thr Leu Arg Arg Ser Ala 
85 90 



<210> 178 

<211> 54 

<212> PRT 

<213> Homo sapien 

<400> 178 

Met Lys Phe Ser Ser Ala Phe Val Gin Ser Lys Pro Leu Ser Ser Cys 
15 10 15 



Arg Ala Glu Thr Leu Tyr Met Lys Thr Val Ser Glu Leu Val Leu Ala 
20 25 30 



Ser lie His Glu Asn Cys Leu Ser Cys Met Leu Ala Lys Thr Ser Ser 
35 40 45 



Glu Thr Lys Lys Leu Lys 
50 



<210> 179 

<211> 88 

<212> PRT 

<213> Homo sapien 

<400> 179 

Gly Arg Val Arg Phe Val Val Glu Leu Ala Asp Pro Lys Leu Glu Val 
15 10 15 



Lys Trp Tyr Lys Asn Gly Gin Glu lie Arg Pro Ser Thr Lys Tyr He 
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20 25 30 



Phe Glu His Lys Gly Cys Gin Arg lie Leu Phe lie Asn Asn Cys Gin 
35 40 45 



Met Thr Asp Asp Ser Glu Tyr Tyr Val Thr Ala Gly Asp Ala Lys Cys 
50 55 60 



Ser Thr Glu Leu Phe Val Arg Glu Pro Pro Phe Met Val Pro Ser Ser 
65 70 75 80 



Trp lie Glu Thr Pro Ala Asp Cys 
85 



<210> 180 

<211> 26 

<212> PRT 

<213> Homo sapien 

<400> 180 

Met Val Leu Tyr Ser Glu Gly His Gin His Gly Pro His Leu Leu Asn 
15 10 15 



Met Glu Asn Gin Asn Leu Asn Glu Tyr Asn 
20 25 



<210> 181 

<211> 57 

<212> PRT 

<213> Homo sapien 

<400> 181 

Met Thr Glu Arg Ala Asp Gly Lys Ser Gin Ser Cys lie Glu Glu lie 
15 10 15 



Ser Met Val Ala Leu Lys Leu Leu Lys Pro Asp Val Ser Ser Ala Ser 
20 25 30 



His Trp Lys Met Asp Arg Trp Ala Asn His His Leu Thr Ser Gin Arg 
35 40 45 



Glu Gly Gin Cys Ala Lys Val Phe Lys 
50 55 



<210> 182 
<211> 67 
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<212> PRT 

<213> Homo sapien 

<400> 182 

Met Asn Thr Lys Ala Leu Pro Thr Thr Pro Ala Gin Thr Ala lie Ser 
15 10 15 



Pro Pro Glu Gly Gin Cys Ser Ser Ser lie Gly Leu Glu Thr lie Pro 
20 25 30 



Glu Ser Pro Cys Phe Arg Thr Pro Glu Ser Ser Asn Ser Pro Ser Leu 
35 40 45 



Arg Arg Asp Leu Leu Ala Ala Lys Arg Val Lys Leu lie Val Leu Gin 
50 55 60 



Ser Ser Ala 
65 



<210> 183 

<211> 91 

<212> PRT 

<213> Homo sapien 

<400> 183 

Asp Val Gly Gly Ala Gin Val Leu Ala Thr Gly Lys Thr Pro Gly Ala 
15 10 15 



Glu He Asp Phe Lys Tyr Ala Leu He Gly Thr Ala Val Gly Val Ala 
20 25 30 



He Ser Ala Gly Phe Leu Ala Leu Lys He Cys Met He Arg Arg His 
35 40 45 



Leu Phe Asp Asp Asp Ser Ser Asp Leu Lys Ser Thr Pro Gly Gly Leu 
50 55 60 



Ser Asp Thr He Pro Leu Lys Lys Arg Ala Pro Arg Arg Asn His Asn 
65 70 75 80 



Phe Ser Lys Arg Asp Ala Gin Val He Glu Leu 
85 90 



<210> 184 
<211> 101 
<212> PRT 
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<213> Homo sapien 
<400> 184 

Met Arg Pro Gly Arg Tyr Gin Ala Pro Arg Pro Phe Leu Tyr His Gly 
15 10 15 



Cys Trp Val Thr Ser Gly Ser His His Leu Phe Pro Ser Leu Phe Pro 
20 25 30 



lie Ser Gin Met Trp Gly His Gly Leu Asp Asp Gly Leu His Arg Ser 
35 40 45 



Phe His Leu Cys Glu Ser Lys Ser Gly Gin Ser Ala Arg Thr His Leu 
50 55 60 



Cys Pro Gly Ser Ala Pro Gin Asn Gin Pro Pro Ala Ser Leu Lys Gin 
65 70 75 80 



Lys Pro His Leu Lys Gly Cys Ser Glu Glu Ser Thr Phe Ser Met Ser 
85 90 95 



Cys Cys Trp Lys lie 
100 



<210> 185 

<211> 489 

<212> PRT 

<213> Homo sapien 

<400> 185 

Gly Trp Thr Val He Gin Asn Arg Gin Asp Gly Ser Val Asp Phe Gly 
15 10 15 



Arg Lys Trp Asp Pro Tyr Lys Gin Gly Phe Gly Asn Val Ala Thr Asn 
20 25 30 



Thr Asp Gly Lys Asn Tyr Cys Gly Leu Pro Gly Asn Glu Gin Ala Cys 
35 40 45 



Lys He Lys Ser Phe Tyr Leu Lys Trp Asp Phe Phe Ala Leu Lys Asn 
50 55 60 



He His Cys Trp Lys Pro Val Leu Gly Ser Ala Glu Glu Phe Pro Asp 
65 70 75 80 
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Lys Asn Val Glu Ala Lys Asp Lys Gly Arg Lys Ala Val Phe Ser Phe 
85 90 95 



Pro Lys Phe Tyr Phe Trp Ala Glu lie Leu Phe Cys Phe Ser Phe Gly 
100 105 110 



Glu Tyr Trp Leu Gly Asn Asp Lys lie Ser Gin Leu Thr Arg Met Gly 
115 120 125 



Pro Thr Glu Leu Leu lie Glu Met Glu Asp Trp Lys Gly Asp Lys Val 
130 135 140 



Lys Ala His Tyr Gly Gly Phe Thr Val Gin Asn Glu Ala Asn Lys Tyr 
145 150 155 160 



Gin He Ser Val Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met 
165 170 175 



Asp Gly Ala Ser Gin Leu Met Gly Glu Asn Arg Thr Met Thr He His 
180 185 190 



Asn Gly Met Phe Phe Ser Thr Tyr Asp Arg Asp Asn Asp Gly Trp Tyr 
195 200 205 



Val Trp His Ser Leu Leu Leu Leu Ala Lys Ser His Ala Tyr His Tyr 
210 215 220 



Ser Glu Ser Leu Thr He Phe Leu He Ala Thr Thr Ser Trp Ala Leu 
225 230 235 240 



Thr Val Ser His Cys Pro Lys Leu Phe Met His His Ser Lys Ala Phe 
245 250 255 



Gin Leu Ala Gly Arg His Ser Tyr Ser His Phe Thr Asp Glu He Ala 
260 265 270 



Arg Asp Tyr Val He Cys Pro Met Ser His Asn Tyr Pro Glu He Lys 
275 280 285 



Leu Glu Phe Glu His Ser Tyr Phe Leu Asn Asn Glu His Leu Asp Lys 
290 295 300 



Tyr Leu Tyr Leu Tyr He Leu Lys Cys Val Ala Lys Leu Ser Phe Ser 
305 310 315 320 
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Phe Pro Gly Phe Ser Asp Thr Lys Gly Cys Lys Ser Tyr Tyr Ser Ser 
325 330 335 



He Lys Ala Gin Thr Gin Ser Leu Asp Gly Leu Pro Gin Arg Pro Ser 
340 345 350 



Tyr Leu Ser Phe Leu Leu Ala Gly Thr Gly Gly Leu Trp Cys He Ser 
355 360 365 



Val Thr Leu Cys He Ala Pro Lys Gly Lys Thr Thr Val His Thr Ser 
370 375 380 



Val Ala Val Phe Tyr Gly Ala Ser Ala Lys Arg Asn Leu Thr Thr Val 
385 390 * 395 400 



Val Leu Phe Leu He Thr Pro Asn Thr Phe Ser Phe Arg Leu Thr Ser 
405 410 415 



Asp Pro Arg Lys Gin Cys Ser Lys Glu Asp Gly Gly Gly Trp Trp Tyr 
420 425 430 



Asn Arg Cys His Ala Ala Asn Pro Asn Gly Arg Tyr Tyr Trp Gly Gly 
435 440 445 



Gin Tyr Thr Trp Asp Met Ala Lys His Gly Thr Asp Asp Gly Val Val 
450 455 460 



Trp Met Asn Trp Lys Gly Ser Trp Tyr Ser Met Arg Lys Met Ser Met 
465 470 475 480 



Lys He Arg Pro Phe Phe Pro Gin Gin 
485 



<210> 186 

<211> 33 

<212> PRT 

<213> Homo sapien 

<400> 186 

Met Val Thr Glu Ser Leu Ser Ser Pro His Ser Glu Ser He Pro Leu 
15 10 15 



Gly Arg Val Asn Pro Gly Ser Gly Leu Pro Pro His Ser Thr Arg Pro 
20 25 30 
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Phe 



<210> 187 

<211> 149 

<212> PRT 

<213> Homo sapien 

<400> 187 

Pro Gly Asn Leu Asp Thr Ser Ser Arg Gly Ser Ser Gly Ser Pro Ala 
15 10 15 



His Ala Glu Ser Tyr Ser Ser Gly Gly Gly Gly Gin Gin Lys Phe Arg 
20 25 30 



Val Asp Met Pro Gly Ser Gly Ser Ala Phe lie Pro Thr lie Asn Ala 
35 40 45 



He Thr Thr Ser Gin Asp Leu Gin Trp Met Val Gin Pro Thr Val He 
50 55 60 



Thr Ser Met Ser Asn Pro Tyr Pro Arg Ser His Pro Tyr Ser Pro Leu 
65 70 75 80 



Pro Gly Leu Ala Ser Val Ala Gly His Met Ala Leu Pro Arg Pro Gly 
85 90 95 



Val He Lys Thr He Gly Thr Thr Val Gly Arg Arg Arg Arg Asp Glu 
100 105 110 



Gin Leu Ser Pro Glu Glu Glu Glu Lys Arg Arg He Arg Arg Glu Arg 
115 120 125 



Asn Lys Leu Ala Ala Ala Lys Cys Arg Asn Arg Arg Arg Glu Leu Thr 
130 135 140 



Glu Lys Leu Gin Ala 
145 



<210> 188 

<211> 41 

<212> PRT 

<213> Homo sapien 



<400> 188 
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Met Thr Val Pro Leu His Thr Ser Leu Ser Tyr Arg Gly Arg Ser Gin 
1 5 10 15 



Leu Leu Lys Thr Lys Thr Thr He Asn He Tyr Lys Asn His Asn He 
20 25 30 



Lys Gly Phe Met Leu Arg Lys Asn Pro 
35 40 



<210> 189 

<211> 45 

<212> PRT 

<213> Homo sapien 

<400> 189 

Met Tyr Thr Asn Lys Tyr Ala Gin Asp Leu Glu Ser Tyr He Lys Met 
15 10 15 



Tyr Leu Thr Trp Leu Glu Cys Val Cys Val Phe Pro Arg Leu Ser Lys 
20 25 30 



He Arg Lys Pro Glu Ser Gin Ala Thr Lys Lys Lys Asn 
35 40 45 



<210> 190 

<211> 91 

<212> PRT 

<213> Homo sapien 

<400> 190 

Met Phe Leu Cys Asn Val Leu Arg Val Thr Trp Ala Ser Pro Thr Tyr 
15 10 15 



Ala Ser Thr Val Cys Cys Val Thr Phe Arg Gin Leu His Thr Pro Pro 
20 25 30 



Ala Pro Leu Pro Ser Pro Pro Ser Ser His Thr Val Ser Ala Gly Cys 
35 40 45 



Gly Ser Pro Thr Ser Val Met Ser Gly He Met Leu Leu Leu Ser Leu 
50 55 60 



Leu Phe Ser Leu Phe Phe Phe Phe Val He Gin Val Leu Leu Thr Ser 
65 70 75 80 



Ser Leu lie His Gin Asn Ala Arg Ser Ser Tyr 
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85 90 



<210> 191 

<211> 100 

<212> PRT 

<213> Homo sapien 

<400> 191 

Ala Asp Asn Asp He Gly Ala Val Ser Thr Thr Gly His Gly Glu Ser 
15 10 15 



He Leu Lys Val Asn Leu Ala Arg Leu Thr Leu Phe His He Glu Gin 
20 25 30 



Gly Lys Thr Val Glu Glu Ala Ala Asp Leu Ser Leu Gly Tyr Met Lys 
35 40 45 



Ser Arg Val Lys Gly Leu Gly Gly Leu He Val Val Ser Lys Thr Gly 
50 55 60 



Asp Trp Val Ala Lys Trp Thr Ser Thr Ser Met Pro Trp Ala Ala Ala 
65 70 75 80 



Lys Asp Gly Lys Leu His Phe Gly lie Asp Pro Asp Asp Thr Thr He 
85 90 95 



Thr Asp Leu Pro 
100 



<210> 192 

<211> 54 

<212> PRT 

<213> Homo sapien 

<400> 192 

Met Glu Glu Gin Glu Glu Ala Leu Cys Ser His His He Pro Val Ala 
15 10 15 



Arg Ser Trp Leu Gin Gly Ser Ser Gly Asn Arg He Pro Arg Ser His 
20 25 30 



Glu Thr Ser Pro Asn Ser Ala Val Thr Glu Ser Thr Arg Gin Trp Leu 
35 40 45 



Lys Asp Gly Glu Thr Ser 
50 
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<210> 193 

<211> 63 

<212> PRT 

<213> Homo sapien 

<400> 193 

Met He He Leu Lys Tyr Arg Trp Lys Asp Thr Asn Ala Arg Lys Arg 
1 5 10 15 



Glu Ser Asn Gin Pro Arg Phe Gly Gly Trp Gly Thr Glu Asp Gly Ala 
20 25 30 



Thr Phe Pro Pro Tyr Leu Leu Phe Phe Tyr He Pro He Cys Thr Leu 
35 40 45 



Arg He His Leu Arg Ser Ser Phe Lys Arg Glu Lys Leu Asp Thr 
50 55 60 



<210> 194 

<211> 211 

<212> PRT 

<213> Homo sapien 

<400> 194 

Met Val Phe Leu Lys Phe Phe Cys Met Ser Phe Phe Cys His Leu Cys 
15 10 15 



Gin Gly Tyr Phe Asp Gly Pro Leu Tyr Pro Glu Met Ser Asn Gly Thr 
20 25 30 



Leu His His Tyr Phe Val Pro Asp Gly Asp Tyr Glu Glu Asn Asp Asp 
35 40 45 



Pro Glu Lys Cys Gin Leu Leu Phe Arg Val Ser Asp His Arg Arg Cys 
50 55 60 



Ser Gin Gly Glu Gly Ser Gin Val Gly Ser Leu Leu Ser Leu Thr Leu 
65 70 75 80 



Arg Glu Glu Phe Thr Val Leu Gly Arg Gin Val Glu Asp Ala Gly Arg 
85 90 95 



Val Leu Glu Gly He Ser Lys Ser He Ser Tyr Asp Leu Asp Gly Glu 
100 105 110 
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Glu Ser Tyr Gly Lys Tyr Leu Arg Arg Glu Ser His Gin lie Gly Asp 
115 120 125 



Ala Tyr Ser Asn Ser Asp Lys Ser Leu Thr Glu Leu Glu Ser Lys Phe 
130 135 140 



Lys Gin Gly Gin Glu Gin Asp Ser Arg Gin Glu Ser Arg Leu Asn Glu 
145 150 155 160 



Asp Phe Leu Gly Met Leu Val His Thr Arg Ser Leu Leu Lys Glu Thr 
165 170 175 



Leu Asp lie Ser Val Gly Leu Arg Asp Lys Tyr Glu Leu Leu Ala Leu 
180 185 190 



Thr lie Arg Ser His Gly Thr Arg Leu Gly Arg Leu Lys Asn Asp Tyr 
195 200 205 



Leu Lys Val 
210 



<210> 195 

<211> 54 

<212> PRT 

<213> Homo sapien 

<400> 195 

Met Asp Asp Ser Lys Leu Gin Lys Lys Lys Asp Val Asp Lys His Cys 
15 10 15 



Leu Thr Glu His Phe lie Phe Ser Gin Leu Phe Trp Phe Leu Leu lie 
20 25 30 



Thr Met Thr Lys Met Leu Asp Ser Glu Leu Cys Arg Tyr Phe Ser Lys 
35 40 45 



Phe Tyr Asp Phe Lys Ser 
50 



<210> 196 

<211> 88 

<212> PRT 

<213> Homo sapien 



<400> 196 
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Met Leu Gly Leu Gin Thr Leu Ser Arg Phe Leu Ser Gly His Pro Gly 
15 10 15 



Phe Leu Thr His Cys Leu Lys Ser Arg Trp Gin Val Pro Ser Leu Asn 
20 25 30 



His Ser Cys Ala Pro Glu Asp Ser Gly Pro Lys Leu Pro Ser Ser Ala 
35 40 45 



Cys His Ser Leu Leu lie He Ser Ser Ser Asp Gin Val Cys Val Met 
50 55 60 



His Leu Ala Gin Ala Gin Gly Val Pro Arg Arg Asp His Asp Pro Ser 
65 70 75 80 



His Cys Ala Arg Ser Ser Ser He 
85 



<210> 197 

<211> 48 

<212> PRT 

<213> Homo sapien 

<400> 197 

Met Thr Glu Met Thr Gin Ser Lys Gly Arg He Gly Thr Glu Asp Ala 
15 10 15 



Asn Thr Gly Ser Tyr Lys He Gin Arg Glu Leu Ser Gly Gly Lys Thr 
20 25 30 



Gin Glu Pro Asn Ser Thr His Leu He Pro Leu Val Asp Gin Leu Asn 

40 45 





35 


<210> 


198 


<211> 


121 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


198 



Phe Phe Ala Asp Glu Val Ser Arg Leu Ser Pro Gly Leu Glu Cys Ser 
15 10 15 



Gly Val He Ser Ala His Cys Asn Phe His Leu Leu Gly Ser Ser Ser 
20 25 30 



Ser Pro Ala Ser Ala Ser Gin Val Ala Glu He Thr Gly Ala Cys His 
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35 40 45 



Pro Thr Trp Leu lie Phe Val lie Leu Val Glu Thr Gly Phe His His 
50 55 60 



Val Gly Gin Ala Asp Ala Leu Leu Thr Ser Gly Asp Pro Pro Phe Ser 
65 70 75 80 



Ala Ser Gin Ser Ala Gly lie Thr Gly Val Ser His Arg Ala Arg Pro 
85 90 95 



Ala Asn Thr Phe Ala Leu Thr Thr Leu Gly Leu Leu Tyr Lys lie Val 
100 105 110 



Met lie Ala Met Glu Val Leu Pro Pro 
115 120 



<210> 199 

<211> 162 

<212> PRT 

<213> Homo sapien 

<400> 199 

Met Asp Ala Ala Gly Gin Val Leu Gly Pro Glu Arg Gly Gly Tyr Leu 
15 10 15 



Pro His Trp Val Ala Ser Ser Ala Ala Pro His Leu Ser Leu Phe Ser 
20 25 30 



Pro Lys Leu Val Phe Leu Thr lie Xle Val Val Gly Gly Gly Gin Met 
35 40 45 



Leu Lys Val Glu Ala Asp Leu Glu Lys Glu Thr His Gly Val Thr Val 
50 55 60 



Ala Lys Asp Ser Trp Lys Arg Asn Ser He Thr Ser Ser Leu Ala Thr 
65 70 75 80 



Thr Arg His Pro Arg Pro Trp His Ser Gin Arg Leu Cys Ala Val Ala 
85 90 95 



Lys Pro Leu Asn Leu Phe Trp Pro Cys Val Leu Gin Arg Ser Leu Cys 
100 105 110 



Cys Lys Thr Val Asp Ser Phe Asp Glu Val Leu Lys Asn Ala Thr Arg 
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115 120 125 



Gly Gly Gly Val Trp Leu Ala Val Trp Pro Ser Ser Glu Lys Val Ala 
130 135 140 



Glu lie Arg Gly Gin Gly Cys His Ser Pro Arg Leu Ser Ser Gly Ser 
145 150 155 160 



Gin Ser 



<210> 200 

<211> 594 

<212> PRT 

<213> Homo sapien 

<400> 200 

Val Pro Gly Arg Lye Leu His Arg Ser Arg Leu Gin Ala Ala Ala Pro 
15 10 15 



Arg Pro Ser Thr Cys Ala Gin Ser Leu Cys Trp Ser Arg Pro Pro Ala 
20 25 30 



Ala Gly Thr Gly Thr Gly Asp Pro Ser Gin Ser Lys Ala Pro Thr Met 
35 40 45 



Ala Met Gly Leu Phe Arg Val Cys Leu Val Val Val Thr Ala He He 
50 55 60 



Asn His Pro Leu Leu Phe Pro Arg Glu Asn Ala Thr Val Pro Glu Asn 
65 70 75 80 



Glu Glu Glu He He Arg Lys Met Gin Ala His Gin Glu Lys Leu Gin 
85 90 95 



Leu Glu Gin Leu Arg Leu Glu Glu Glu Val Ala Arg Leu Ala Ala Glu 
100 105 110 



Lys Glu Ala Leu Glu Gin Val Ala Glu Glu Gly Arg Gin Gin Asn Glu 
115 120 125 



Thr Arg Val Ala Trp Asp Leu Trp Ser Thr Leu Cys Met He Leu Phe 
130 135 140 



Leu Met He Glu Val Trp Arg Gin Asp His Gin Glu Gly Pro Ser Pro 
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145 150 155 160 



Glu Cys Leu Gly Gly Glu Glu Asp Glu Leu Pro Gly Leu Gly Gly Ala 
165 170 175 



Pro Leu Gin Gly Leu Thr Leu Pro Asn Lys Ala Thr Leu Gly His Phe 
180 185 190 



Tyr Glu Arg Cys lie Arg Gly Ala Thr Ala Asp Ala Ala Arg Thr Arg 
195 200 205 



Glu Phe Leu Glu Gly Phe Val Asp Asp Leu Leu Glu Ala Leu Arg Ser 
210 215 220 



Leu Cys Asn Arg Asp Thr Asp Met Glu Val Glu Asp Phe He Gly Val 
225 230 235 240 



Asp Ser Met Tyr Glu Asn Trp Gin Val Asp Arg Pro Leu Leu Cys His 
245 250 255 



Leu Phe Val Pro Phe Thr Pro Pro Glu Pro Tyr Arg Phe His Pro Glu 
260 265 270 



Leu Trp Cys Ser Gly Arg Ser Val Pro Leu Asp Arg Gin Gly Tyr Gly 
275 280 285 



Gin He Lys Val Val Arg Ala Asp Gly Asp Thr Leu Ser Cys He Cys 
290 295 300 



Gly Lys Thr Lys Leu Gly Glu Asp Met Leu Cys Leu Leu His Gly Arg 
305 310 315 320 



Asn Ser Met Ala Pro Pro Cys Gly Asp Met Glu Asn Leu Leu Cys Ala 
325 330 335 



Thr Asp Ser Leu Tyr Leu Asp Thr Met Gin Val Met Lys Trp Phe Gin 
340 345 350 



Thr Ala Leu Thr Arg Ala Trp Lys Gly He Ala His Lys Tyr Glu Phe 
355 360 365 



Asp Leu Ala Phe Gly Gin Leu Asp Ser Pro Gly Ser Leu Lys He Lys 
370 375 380 
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Phe Arg Ser Gly Lys Phe Met Pro Phe Asn Leu lie Pro Val lie Gin 
385 390 395 400 



Cys Asp Asp Ser Asp Leu Tyr Phe Val Ser His Leu Pro Arg Glu Pro 
405 410 415 



Ser Glu Gly Thr Pro Ala Ser Ser Thr Asp Trp Leu Leu Ser Phe Ala 
420 425 430 



Val Tyr Glu Arg His Phe Leu Arg Thr Thr Leu Lys Ala Leu Pro Glu 
435 440 445 



Gly Ala Cys His Leu Ser Cys Leu Gin lie Ala Ser Phe Leu Leu Ser 
450 455 460 



Lys Gin Ser Arg Leu Thr Gly Pro Ser Gly Leu Ser Ser Tyr His Leu 
465 470 475 480 



Lys Thr Ala Leu Leu His Leu Leu Leu Leu Arg Gin Ala Ala Asp Trp 
485 490 495 



Lys Ala Gly Gin Leu Asp Ala Arg Leu His Glu Leu Leu Cys Phe Leu 
500 505 510 



Glu Lys Ser Leu Leu Gin Lys Lys Leu His His Phe Phe lie Gly Asn 
515 520 525 



Arg Lys Val Pro Glu Ala Met Gly Leu Pro Glu Ala Val Leu Arg Ala 
530 535 540 



Glu Pro Leu Asn Leu Phe Arg Pro Phe Val Leu Gin Arg Ser Leu Tyr 
545 550 555 560 



Arg Lys Thr Leu Asp Ser Phe Tyr Glu Met Leu Lys Asn Ala Pro Ala 
565 570 575 



Leu He Ser Glu Tyr Ser Leu His Val Pro Ser Asp Gin Pro Thr Pro 
580 585 590 



Lys Ser 



<210> 201 
<211> 38 
<212> PRT 
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<213> Homo sapien 
<400> 201 

Met Ser Leu His Ala Glu Val Gly Gly Ala Leu Lys Pro Val lie Tyr 
15 10 15 



Ala Val Lys Thr Lys Trp Val Cys Tyr Leu lie Ser Trp Gly lie His 
20 25 30 



Gly Leu Ala Val Pro Gly 





35 


<210> 


202 


<211> 


16 


<212> 


PRT 


<213> 


Homo 


<400> 


202 



Met Glu Arg lie Gly Thr Phe Tyr Ser Gly Asn Thr Gin Pro Ala Thr 
15 10 15 



<210> 


203 


<211> 


87 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


203 


Met Ala Glu Gly Val 


1 


5 



10 15 



Ser Leu Pro Ser Ala Ser Pro Val Pro Pro Ala Ala Leu Val Thr Val 
20 25 30 



Ser Asp Gly Tyr Leu Pro Gly Phe Val Ala Ser Leu Ser Val Phe Ser 
35 40 45 



Cys Ser Asp Pro Leu Ala Gly Trp Leu Arg Lys Lys Lys Met Cys Phe 
50 55 60 



Arg Cys His Cys Asn Pro Gly His Gin Gly Asn Pro Ser Phe Pro Phe 
65 70 75 80 



Leu He Cy s Ser Pr ° Ar9 Thr 
85 



<210> 204 
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<211> 252 
<212> PRT 
<213> Homo sapien 

<400> 204 

Met Ser He Tyr Lys Glu Pro Pro Pro Gly Met Phe Val Val Pro Asp 
15 10 15 



Thr Val Asp Met Thr Lys He His Ala Leu He Thr Gly Pro Phe Asp 
20 25 30 



Thr Pro Tyr Glu Gly Gly Phe Phe Leu Phe Val Phe Arg Cys Pro Pro 
35 40 45 



Asp Tyr Pro He His Pro Pro Arg Val Lys Leu Met Thr Thr Gly Asn 
50 55 60 



Asn Thr Val Arg Phe Asn Pro Asn Phe Tyr Arg Asn Gly Lys Val Cys 
65 70 75 80 



Leu Ser lie Leu Gly Thr Trp Thr Gly Pro Ala Trp Ser Pro Ala Gin 
85 90 95 



Ser He Ser Ser Val Leu He Ser He Gin Ser Leu Met Thr Glu Asn 
100 105 110 



Pro Tyr His Asn Glu Pro Gly Phe Glu Gin Glu Arg His Pro Gly Asp 
115 120 125 



Ser Lys Asn Tyr Asn Glu Cys He Arg His Glu Thr He Arg Val Ala 
130 135 140 



Val Cys Asp Met Met Glu Gly Lys Cys Pro Cys Pro Glu Pro Leu Arg 
145 150 155 160 



Gly Val Met Glu Lys Ser Phe Leu Glu Tyr Tyr Asp Phe Tyr Glu Val 
165 170 175 



Ala Cys Lys Asp Arg Leu His Leu Gin Gly Gin Thr Met Gin Asp Pro 
180 185 190 



Phe Gly Glu Lys Arg Gly His Phe Asp Tyr Gin Ser Leu Leu Met Arg 
195 200 205 



Leu Gly Leu He Arg Gin Lys Val Leu Glu Arg Leu His Asn Glu Asn 
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210 215 220 



Ala Glu Met Asp Ser Asp Ser Ser Ser Ser Gly Thr Glu Thr Asp Leu 
225 230 235 240 



His Gly Ser Leu Arg Val His Gly Ser Leu Arg Val 
245 250 



<210> 205 

<211> 91 

<212> PRT 

<213> Homo sapien 

<400> 205 

Met Leu Thr Pro Ala Arg Pro Ser Cys His Thr Leu Ser Gly Arg Ser 
15 10 15 



Met Ala Tyr Arg Met Lys Arg Gly Thr Arg Asn Pro Cys Gly Arg Gly 
20 25 30 



Leu Asp Leu Lys Gin Cys Pro Leu Trp Leu Leu Leu Pro Trp Leu Thr 
35 40 45 



Gly Phe Leu Asp His Val His Phe Thr Gly Pro Trp Asp Leu His Leu 
50 55 60 



Leu Ala Ser Pro Ala Gly Leu lie Pro Ala Arg Ala Pro Ser Phe Leu 
65 70 75 80 



Leu Met Val Phe Arg Trp Pro Asp His Gly Lys 
85 90 



<210> 206 

<211> 213 

<212> PRT 

<213> Homo sapien 

<400> 206 

Ser Pro His Gin Ala Ala Ala Pro Val Asp Gin Thr Pro Arg Thr Leu 
15 10 15 



Ala Thr Met Gly Gin Arg Ala Leu Pro Ser Ser Leu Ala Leu Leu Ser 
20 25 30 



Arg Pro Leu Ser Pro Pro Pro Ala Ala Cys Ser Gly Asp Pro Gly Cys 
35 40 45 
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Gly Ser Gly Ala Gly Leu Pro Ser Ala Ser Ala Ala Ala Gly lie Ala 
50 55 60 



Ser Ser Ala Val Glu Ala Val Cys Gly Asp Ala Ala Pro Ala Cys Leu 
65 70 75 80 



Leu Arg Thr Pro Leu Arg Gly Leu Leu Lys Pro Thr Gly Pro Arg Ser 
85 90 95 



Thr Met Glu Cys Pro Pro Ala Leu lie Val His Pro Pro Thr Gly Gly 
100 105 110 



Met Ala Arg Arg Ala Ala Ser Gin Pro Trp Ala Ala Ala Ser Ala Thr 
115 120 125 



Pro Met Leu Ser Ser Lys Ala Ser Leu Cys lie Pro Thr Glu Arg Pro 
130 135 140 



Pro Pro Gin Pro Leu Met Arg Thr Pro Ala Ala Arg Ser His Trp Pro 
145 150 155 160 



He Pro His Pro Ala Ser Thr Ala Cys Pro Ala Pro Leu Pro Val Val 
165 170 175 



Leu Val Ala Pro Arg Ser Thr He Leu Ser Met Ser Arg Thr Trp Thr 
180 185 190 



Cys Arg Arg Trp Ala Val Ala Pro Cys Arg Ala Glu Lys Leu Met Cys 
195 200 205 



Ser Ser Ser Arg Ser 
210 



<210> 207 

<211> 92 

<212> PRT 

<213> Homo sapien 

<400> 207 

Met Tyr Lys Gly Ala Ala Trp Arg Gly Lys Glu His Asn Lys Thr Pro 
15 10 15 



Leu Glu Val Phe Gin Arg Val Val Ser Gin He Ser Leu He Gin Glu 
20 25 30 
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Glu Asp Asp Glu Arg Glu Arg Thr Trp Asn Tyr Leu Lys Ser Ser Asn 
35 40 45 



Ser Leu Val Leu Phe Asn Lys Lys Glu Phe Trp Phe Val Ala Glu Ser 
50 55 60 



Asp Leu Thr Ala Ala Asn Ser Ser Leu Leu Leu Arg Cys lie Ser Asn 
65 70 75 80 



Ser Lys Leu Asp Ala Pro Pro Ser Leu Phe Phe Pro 
85 90 



<210> 208 

<211> 130 

<212> PRT 

<213> Homo sapien 

<400> 208 

Met Val Cys Glu Asp Ala Pro Ser Phe Gin Met Ala Trp Glu Ser Gin 
15 10 15 



Met Ala Trp Glu Arg Gly Pro Ala Leu Leu Cys Cys Val Leu Ser Ala 
20 25 30 



Ser Gin Leu Ser Ser Gin Asp Gin Asp Pro Leu Gly His He Lys Ser 
35 40 45 



Leu Leu Tyr Pro Phe Gly Phe Pro Val Glu Leu Pro Arg Pro Gly Pro 
50 55 60 



Thr Gly Ala Tyr Lys Lys Val Lys Asn Gin Asn Gin Thr Thr Ser Ser 
65 70 75 80 



Glu Leu Leu Arg Lys Gin Thr Ser His Phe Asn Gin Arg Gly His Arg 
85 90 95 



Ala Arg Ser Lys Leu Leu Ala Ser Arg Gin He Pro Asp Arg Thr Phe 
100 105 110 



Lys Cys Gly Lys Trp Leu Pro Gin Val Pro Ser Pro Val Val Pro Ser 
115 120 125 



Pro Val 
130 
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<210> 209 

<211> 63 

<212> PRT 

<213> Homo sapien 

<400> 209 

Met Asn Asp Tyx Gly Leu Gly Leu Gly Phe lie Thr Asn Pro lie He 
15 10 15 



Asp His Leu Phe Pro Ala Leu Gly He Thr Ala Lys Pro Asn Gly Ser 
20 25 30 



Phe Ser He Thr Ala Ser Tyr Asn Phe His He Phe Leu Leu Phe Leu 
35 40 45 



Thr Gly Leu Gin Val Leu Ser Asn Val Leu Lys Leu Phe Asn Val 
50 55 60 



<210> 210 

<211> 451 

<212> PRT 

<213> Homo sapien 

<400> 210 

Ala Thr Lys Thr Lys Ala Pro Asp Asp Leu Val Ala Pro Val Val Lys 
15 10 15 



Lys Pro His He Tyr Tyr Gly Ser Leu Glu Glu Lys Glu Arg Glu Arg 
20 25 30 



Leu Ala Lys Gly Glu Ser Gly He Leu Gly Lys Asp Gly Leu Lys Ala 
35 40 45 



Gly He Glu Ala Gly Asn He Asn He Thr Ser Gly Glu Val Phe Glu 
50 55 60 



He Glu Glu His He Ser Glu Arg Gin Ala Glu Val Leu Ala Glu Phe 
65 70 75 80 



Glu Arg Arg Lys Arg Ala Arg Gin He Asn Val Ser Thr Asp Asp Ser 
85 90 95 



Glu Val Lys Ala Cys Leu Arg Ala Leu Gly Glu Pro He Thr Leu Phe 
100 105 HO 
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Gly Glu Gly Pro Ala Glu Arg Arg Glu Arg Leu Arg Asn He Leu Ser 
115 120 125 



Val Val Gly Thr Asp Ala Leu Lys Lys Thr Lys Lys Asp Asp Glu Lys 
130 135 140 



Ser Lys Lys Ser Lys Glu Glu Tyr Gin Gin Thr Trp Tyr His Glu Gly 
145 150 155 160 



Pro Asn Ser Leu Lys Val Ala Arg Leu Trp He Ala Asn Tyr Ser Leu 
165 170 175 



Pro Arg Ala Met Lys Arg Leu Glu Glu Ala Arg Leu His Lys Glu He 
180 185 190 



Pro Glu Thr Thr Arg Thr Ser Gin Met Gin Glu Leu His Lys Ser Leu 
195 200 205 



Arg Ser Leu Asn Asn Phe Cys Ser Gin He Gly Asp Asp Arg Pro He 
210 215 220 



Ser Tyr Cys His Phe Ser Pro Asn Ser Lys Met Leu Ala Thr Ala Cys 
225 230 235 240 



Cys Asp Glu Pro Val Ala Asp He Glu Gly His Thr Val Arg Val Ala 
245 250 255 



Arg Val Met Trp His Pro Ser Gly Arg Phe Leu Gly Thr Thr Cys Tyr 
260 265 270 



Asp Arg Ser Trp Arg Leu Trp Asp Leu Glu Ala Gin Glu Glu He Leu 
275 280 285 



His Gin Glu Gly His Ser Met Gly Val Tyr Asp He Ala Phe His Gin 
290 295 300 



Asp Gly Ser Leu Ala Gly Thr Gly Gly Leu Asp Ala Phe Gly Arg Val 
305 310 315 320 



Trp Asp Leu Arg Thr Gly Arg Cys He Met Phe Leu Glu Gly His Leu 
325 330 335 



Lys Glu He Tyr Gly He Asn Phe Ser Pro Asn Gly Tyr His He Ala 
340 345 350 
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Thr Gly Ser Gly Asp Asn Thr Cys Lys Val Trp Asp Leu Arg Gin Arg 
355 360 365 



Arg Cys Val Tyr Thr He Pro Ala His Gin Asn Leu Val Thr Gly Val 
370 375 380 



Lys Phe Glu Pro He His Gly Asn Phe Leu Leu Thr Gly Ala Tyr Asp 
385 390 395 400 



Asn Thr Ala Lys He Trp Thr His Pro Gly Trp Ser Pro Leu Lys Thr 
405 410 415 



Leu Ala Gly His Glu Gly Lys Val Met Gly Leu Asp He Ser Ser Asp 
420 425 430 



Gly Gin Leu He Ala Thr Cys Ser Tyr Asp Arg Thr Phe Lys Leu Trp 
435 440 445 



Met Ala Glu 
450 



<210> 211 

<211> 34 

<212> PRT 

<213> Homo sapien 

<400> 211 

Met Glu Ala Gin Gly Cys His Asp Gly Ser Val Val He Arg Glu Gly 
15 10 15 



Ala Pro Phe He Leu Leu Pro Thr Pro Leu Leu Cys Pro Phe Leu Pro 
20 25 30 



Leu He 



<210> 212 

<211> 610 

<212> PRT 

<213> Homo sapien 

<400> 212 

Gly Lys Ala Phe He Thr Cys Arg Thr Leu Leu Asn His Lys Ser He 
15 10 15 
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His Phe Gly Asp Lys Pro Tyr Lys Cys Asp Glu Cys Glu Lys Ser Phe 
20 25 30 



Asn Tyr Ser Ser Leu Leu lie Gin His Lys Val lie His Thr Gly Glu 
35 40 45 



Lys Pro Tyr Glu Cys Asp Glu Cys Gly Lys Ala Phe Arg Asn Ser Ser 
50 55 60 



Gly Leu lie Val His Lys Arg lie His Thr Gly Glu Lys Pro Tyr Lys 
65 70 75 80 



Cys Asp Val Cys Gly Lys Ala Phe Ser Tyr Ser Ser Gly Leu Ala Val 
85 90 95- 



His Lys Ser lie His Pro Gly Lys Lys Ala His Glu Cys Lys Glu Cys 
100 105 110 



Gly Lys Ser Phe Ser Tyr Asn Ser Leu Leu Leu Gin His Arg Thr lie 
115 120 125 



His Thr Gly Glu Arg Pro Tyr Val Cys Asp Val Cys Gly Lys Thr Phe 
130 135 140 



Arg Asn Asn Ala Gly Leu Lys Val His Arg Arg Leu His Thr Gly Glu 
145 150 155 160 



Lys Pro Tyr LyB Cys Asp Val Cys Gly Lys Ala Tyr lie Ser Arg Ser 
165 170 175 



Ser Leu Lys Asn His Lys Gly lie His Leu Gly Glu Lys Pro Tyr Lys 
180 185 190 



Cys Ser Tyr Cys Glu Lys Ser Phe Asn Tyr Ser Ser Ala Leu Glu Gin 
195 200 205 



His Lys Arg lie His Thr Arg Glu Lys Pro Phe Gly Cys Asp Glu Cys 
210 215 220 



Gly Lys Ala Phe Arg Asn Asn Ser Gly Leu Lys Val His Lys Arg lie 
225 230 235 240 



His Thr Gly Glu Arg Pro Tyr Lys Cys Glu Glu Cys Gly Lys Ala Tyr 
245 250 255 
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He Ser Leu Ser Ser Leu He Asn His Lye Ser Val His Pro Gly Glu 
260 265 270 



Lys Pro Phe Lys Cys Asp Glu Cys Glu Lys Ala Phe He Thr Tyr Arg 
275 280 285 



Thr Leu Thr Asn His Lys Lys Val His Leu Gly Glu Lys Pro Tyr Lys 
290 295 300 



Cys Asp Val Cys Glu Lys Ser Phe Asn Tyr Thr Ser Leu Leu Ser Gin 
305 310 315 320 



His Arg Arg Val His Thr Arg Glu Lys Pro Tyr Glu Cys Asp Arg Cys 
325 330 335 



Glu Lys Val Phe Arg Asn Asn Ser Ser Leu Lys Val His Lys Arg He 
340 345 350 



His Thr Gly Glu Arg Pro Tyr Glu Cys Asp Val Cys Gly Lys Ala Tyr 
355 360 365 



He Ser His Ser Ser Leu He Asn His Lys Ser Thr His Pro Gly Lys 
370 375 380 



Thr Pro His Thr Cys Asp Glu Cys Gly Lys Ala Phe Phe Ser Ser Arg 
385 390 395 400 



Thr Leu He Ser His Lys Arg Val His Leu Gly Glu Lys Pro Phe Lys 
405 410 415 



Cys Val Glu Cys Gly Lys Ser Phe Ser Tyr Ser Ser Leu Leu Ser Gin 
420 425 430 



His Lys Arg He His Thr Gly Glu Lys Pro Tyr Val Cys Asp Arg Cys 
435 440 445 



Gly Lys Ala Phe Arg Asn Ser Ser Gly Leu Thr Val His Lys Arg lie 
450 455 460 



His Thr Gly Glu Lys Pro Tyr Glu Cys Asp Glu Cys Gly Lys Ala Tyr 
465 470 475 480 



He Ser His Ser Ser Leu He Asn His Lys Ser Val His Gin Gly Lys 
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485 490 495 



Gin Pro Tyr Asn Cys Glu Cys Gly Lys Ser Phe Asn Tyr Arg Ser Val 
500 505 510 



Leu Asp Gin His Lys Arg lie His Thr Gly Lys Lys Pro Tyr Arg Cys 
515 520 525 



Asn Glu Cys Ala His lie Pro Asn Ala Thr Ala Asp Leu Met Lys Val 
530 535 540 



Asp His Glu Glu Glu Pro Gin Leu Ser Glu Pro Tyr Leu Ser Lys Gin 
545 550 555 560 



Lys Lys Leu Met Ala Lys lie Leu Glu His Asp Asp Val Ser Tyr Leu 
565 570 575 



Lys Lys lie Leu Gly Glu Leu Ala Met Val Leu Asp Gin lie Glu Ala 
580 585 590 



Glu Leu Glu Lys Arg Lys Leu Glu Asn Glu Ala Leu Ser Gin Trp Lys 
595 600 605 



Glu Phe 
610 



<210> 213 

<211> 47 

<212> PRT 

<213> Homo sapien 

<400> 213 

Met Cys Ala Lys Trp Gly Glu lie Gly Ala Gly Lys Pro He Pro His 
15 10 15 



Arg Gly Pro Ala Leu Ala Pro Gly Ser Pro His Ala Phe Phe Val Phe 
20 25 30 



Phe Phe Phe Phe Ala Ser Asp Gin Phe Thr Thr Val Ser Trp Thr 
35 40 45 



<210> 214 

<211> 25 

<212> PRT 

<213> Homo sapien 
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<400> 214 

Met Glu Thr Pro Ser Leu Glu Qly Thr Pro Arg Lys Pro Cys His Gly 
15 10 15 



Leu Leu Ser Leu Ser Ser Leu Leu Leu 
20 25 



<210> 215 

<211> 29 

<212> PRT 

<213> Homo sapien 

<400> 215 

Met Ser Ser Tyr Gly Met Gin Gly Thr Val Gly Ser Arg Val Ser lie 
15 10 15 



Leu Pro Thr Arg Ala Gin Gly Gin Ala Gly Glu Val Arg 
20 25 



<210> 216 

<211> 64 

<212> PRT 

<213> Homo sapien 

<400> 216 

Met Val Thr Leu Asp Leu Leu Glu Arg Ala Gin Cys Asp Gly Ser Trp 
1-5 10 15 



Ser Arg Arg Gly Thr Pro Leu Leu Phe Tyr Phe Phe Cys Lys Val Leu 
20 25 30 



Thr Leu Glu Gly Tyr Ser He Gin Ser Leu Asn Met Phe Phe Lys Arg 
35 40 45 



Asn Lys Glu Gin Ala Thr Ala Leu Leu Glu lie Thr Asn Arg Phe Leu 
50 55 60 



<210> 217 

<211> 50 

<212> PRT 

<213> Homo sapien 

<400> 217 

Met Glu Pro His He Met Lys Phe Asn Ser His Val Lys Thr Phe Cys 
15 io 15 
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lie Val Gly Cys Gin Lys Tyr Phe Pro Asn Phe Arg Leu Thr Cys Arg 
20 25 30 



Val Gly Asp Gly Leu Pro Pro Tyr Asn Phe Lye Phe Val Ser Gin Ser 
35 40 45 



Leu Ala 
50 



<210> 218 

<211> 785 

<212> PRT 

<213> Homo sapien 

<400> 218 

Lys Ala Lys lie Ser Trp Glu Ala Pro Val Glu Lys Lys Thr Glu Cys 
15 10 15 



lie Gin Lys Gly Lys Asn Asn Gin Val Gly Ala Trp Thr Leu Leu Leu 
20 25 30 



Val Leu Pro Ser Pro Gin Asp Val Ser Ser His Ser Gly Pro Arg Ala 
35 40 45 



Leu Thr Asn Arg Thr Pro Phe Cys Pro Gin Thr Glu Cys Phe Asn Phe 
50 55 60 



lie Arg Phe Leu Gin Pro Tyr Asn Ala Ser His Leu Tyr Val Cys Gly 
65 70 75 80 



Thr Tyr Ala Phe Gin Pro Lys Cys Thr Tyr Val Asn Met Leu Thr Phe 
85 90 95 



Thr Leu Glu His Gly Glu Phe Glu Asp Gly Lys Gly Lys Cys Pro Tyr 
100 105 110 



Asp Pro Ala Lys Gly His Ala Gly Leu Leu Val Asp Gly Glu Leu Tyr 
115 120 125 



Ser Ala Thr Leu Asn Asn Phe Leu Gly Thr Glu Pro lie lie Leu Arg 
130 135 140 



Asn Met Gly Pro His His Ser Met Lys Thr Glu Tyr Leu Ala Phe Trp 
145 150 155 1^0 
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Leu Asn Glu Pro His Phe Val Gly Ser Ala Tyr Val Pro Glu Ser Val 
165 170 175 



Gly Ser Phe Thr Gly Asp Asp Asp Lys Val Tyr Phe Phe Phe Arg Glu 
180 185 190 



Arg Ala Val Glu Ser Asp Cys Tyr Ala Glu Gin Val Val Ala Arg Val 
195 200 205 



Ala Arg Val Cys Lys Gly Asp Met Gly Gly Ala Arg Thr Leu Gin Arg 
210 215 220 



Lys Trp Thr Thr Phe Leu Lys Ala Arg Leu Ala Cys Ser Ala Pro Asn 
225 230 235 240 



Trp Gin Leu Tyr Phe Asn Gin Leu Gin Ala Met His Thr Leu Gin Asp 
245 250 255 



Thr Ser Trp His Asn Thr Thr Phe Phe Gly Val Phe Gin Ala Gin Trp 
260 265 270 



Gly Asp Met Tyr Leu Ser Ala lie Cys Glu Tyr Gin Leu Glu Glu lie 
275 280 285 



Gin Arg Val Phe Glu Gly Pro Tyr Lys Glu Tyr His Glu Glu Ala Gin 
290 295 300 



Lys Trp Asp Arg Tyr Thr Asp Pro Val Pro Ser Pro Arg Pro Gly Ser 
305 310 315 320 



Cys lie Asn Asn Trp His Arg Arg His Gly Tyr Thr Ser Ser Leu Glu 
325 330 335 



Leu Pro Asp Asn lie Leu Asn Phe Val Lys Lys His Pro Leu Met Glu 
340 345 350 



Glu Gin Val Gly Pro Arg Trp Ser Arg Pro Leu Leu Val Lys Lys Gly 
355 360 365 



Thr Asn Phe Thr His Leu Val Ala Asp Arg Val Thr Gly Leu Asp Gly 
370 375 380 



Ala Thr Tyr Thr Val Leu Phe lie Gly Thr Gly Asp Gly Trp Leu Leu 
385 390 395 400 
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Lys Ala Val Ser Leu Gly Pro Trp Val His Leu lie Glu Glu Leu Gin 
405 410 415 



Leu Phe Asp Gin Glu Pro Met Arg Ser Leu Val Leu Ser Gin Ser Lys 
420 425 430 



Val Lys Leu Leu Phe Ala Gly Ser Arg Ser Gin Leu Val Gin Leu Pro 
435 440 445 



Val Ala Asp Cys Met Lys Tyr Arg Ser Cys Ala Asp Cys Val Leu Ala 
450 455 460 



Arg Asp Pro Tyr Cys Ala Trp Ser Val Asn Thr Ser Arg Cys Val Ala 
465 470 475 480 



Val Gly Gly His Ser Gly Ser Leu Leu lie Gin His Val Met Thr Ser 
485 490 495 



Asp Thr Ser Gly lie Cys Asn Leu Arg Gly Ser Lys Lys Val Arg Pro 
500 505 • 510 



Thr Pro Lys Asn He Thr Val Val Ala Gly Thr Asp Leu Val Leu Pro 
515 520 525 



Cys His Leu Ser Ser Asn Leu Ala His Ala Arg Trp Thr Phe Gly Gly 
530 535 540 



Arg Asp Leu Pro Ala Glu Gin Pro Gly Ser Phe Leu Tyr Asp Ala Arg 
545 550 555 560 



Leu Gin Ala Leu Val Val Met Ala Ala Gin Pro Arg His Ala Gly Ala 
565 570 575 



Tyr His Cys Phe Ser Glu Glu Gin Gly Ala Arg Leu Ala Ala Glu Gly 
580 585 590 



Tyr Leu Val Ala Val Val Ala Gly Pro Ser Val Thr Leu Glu Ala Arg 
595 600 605 



Ala Pro Leu Glu Asn Leu Gly Leu Val Trp Leu Ala Val Val Ala Leu 
610 615 620 



Gly Ala Val Cys Leu Val Leu Leu Leu Leu Val Leu Ser Leu Arg Arg 
625 630 635 640 
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Arg Leu Arg Glu Glu Leu Glu Lys Gly Ala Lys Ala Thr Glu Arg Thr 
645 650 655 



Leu Val Tyr Pro Leu Glu Leu Pro Lys Glu Pro Thr Ser Pro Pro Phe 
660 665 670 



Arg Pro CyB Pro Glu Pro Asp Glu Lys Leu Trp Asp Pro Val Gly Tyr 
675 680 685 



Tyr Tyr Ser Asp Gly Ser Leu Lys lie Val Pro Gly His Ala Arg Cys 
690 695 700 



Gin Pro Gly Gly Gly Pro Pro Ser Pro Pro Pro Gly He Pro Gly Gin 
705 710 • 715 720 



Pro Leu Pro Ser Pro Thr Arg Leu His Leu Gly Gly Gly Arg Asn Ser 
725 730 735 



Asn Ala Asn Gly Tyr Val Arg Leu Gin Leu Gly Gly Glu Asp Arg Gly 
740 745 750 



Gly Leu Gly His Pro Leu Pro Glu Leu Ala Asp Glu Leu Arg Arg Lys 
755 760 765 



Leu Gin Gin Arg Gin Pro Leu Pro Asp Ser Asn Pro Glu Glu Ser Ser 
770 775 780 



Val 
785 



<210> 219 

<211> 66 

<212> PRT 

<213> Homo sapien 

<400> 219 

Met Lys Met Arg Ala Lys He Leu His Gin Asn Gly Asn Asp Pro He 
15 10 15 



Ser Pro Val Lys Ala Glu Trp Val Glu Trp Gly Leu Arg Val Trp He 
20 25 30 



Gin Cys Phe Glu Leu His Ser Ser Arg Glu Ala Val Gin Lys Gly Gly 
35 40 45 
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lie Leu Gly Asn Leu Arg Lys He Val Gly Glu Thr Ser Phe Leu Leu 
50 55 60 



Val Ser 



65 




<210> 


220 


<211> 


128 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


220 



Glu Val Glu Gly Arg Ser Ala Cys Met Ala Met Gly Leu Phe Phe He 
15 10 15 



Pro Phe Leu Asn Cys Thr Gin Gin Gin Trp Phe Leu Leu Gly Leu Leu 
20 25 30 



Lys Thr Ala Gly He Trp Glu Lys Glu His His Arg Leu Ser Gin His 
35 40 45 



Gly Asn He Asn Leu He Pro Glu Lys Gly Arg Ser Pro Gin Arg Tyr 
50 55 60 



Val Arg Phe Asn Ser Phe Ser Ser Gly Pro Gly Ser Ser Phe Ser Cys 
65 70 75 80 



Ser Gly Leu Asn Arg Asp Ala Leu He Ser Leu Gly He Leu Leu Leu 
85 90 95 



Val Leu Ser Leu Thr Ser Gly Ala Lys He Arg Arg Pro Glu Phe Gin 
100 105 110 



He Tyr Ser Val Thr Gin Ser Leu Leu Gin Ser Leu Arg Asp Val Val 

120 125 





115 


<210> 


221 


<211> 


64 


<212> 


PRT 


<213> 


Homo 


<400> 


221 



Met Gly He Leu Glu Pro Gin Asp Val Arg Ala Gly Arg Asp Ala He 
15 10 15 
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Pro Val Tyx Thr Arg Gly Asn Ser Ser Arg Leu Trp Glu Gly Arg Arg 
20 25 30 



Val Leu Val Thr Glu Arg Glu Leu Lys Leu Arg lie Pro Glu Ser Arg 
35 40 45 



Ser Cys Leu Pro Ser Ala lie Phe Leu Pro lie Asn Leu Cys Tyr Val 
50 55 60 



<210> 222 

<211> 105 

<212> PRT 

<213> Homo sapien 

<400> 222 

Cys Lys Leu Phe Gly Arg Val Gly Asp Ala Val Ser Phe Cys His Pro 
15 10 15 



Gly Trp Ser Ala Val Ala Arg Ser Gin Leu Thr Ala Thr Ser Ala Leu 
20 25 30 



Gin Gly Ser Gly Asn Ser Ala Ser Val Ser Ala Val Ala Gly lie Thr 
35 40 45 



Gly Met Arg His His Thr Arg Leu lie Phe Val Phe Leu Val Glu Thr 
50 55 60 



Arg Phe His His Val Gly Gin Asp Gly Leu Glu Pro Leu Thr Ser Gly 
65 70 75 80 



Asp Leu Pro lie Ser Ala Ser Gin Ser Ala Gly He Thr Ser Val Ser 
85 90 95 



His Arg Ala Arg Pro Ala Asn Phe Phe 
100 105 



<210> 223 

<211> 109 

<212> PRT 

<213> Homo sapien 

<400> 223 



Met Met Trp Leu Ser Val Gly Gly Gly Gly Arg Glu Trp Ser Glu Met 
15 10 15 
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Leu Gly Val Val Trp Trp Trp Gly Gly Val Gly Val Trp Val Gly Val 
20 25 30 



Gly Val Cys Gly Cys Val Trp Trp Val Val Val Gly Val Trp Trp Trp 
35 40 45 



Arg Cys Val Gly Cys Gly Cys Val Val Trp Trp Gly Gly Val Val Gly 
50 55 60 



Val Gly Gly Cys Trp Gly Gly Cys Val Cys Val Val Gly Val Cys Val 
65 70 75 80 



Cys Val Gly Gly Gly Val Val Gly Arg Val Val Gly Gly Ala Gly Val 
85 90 95 



Cys Gly Gly Arg Cys Gly Cys Cys Val Val Trp Trp Cys 
100 105 



<210> 224 

<211> 196 

<212> PRT 

<213> Homo sapien 

<400> 224 

Thr Arg Pro Gin Ser His Thr Thr Thr Glu His Pro Pro Pro Pro Pro 
15 10 15 



Thr Thr lie His He Thr Gin Thr Leu His Lys Lys Thr Asn Thr Thr 
20 25 30 



Asn Thr Gin Gin Lys Lys His Thr Asn Thr Gin He Thr He Thr Gin 
35 40 45 



Gin His Thr Pro Gin His Thr Thr Thr Pro Pro Thr Pro His His Ser 
50 55 60 



Thr Pro Pro His Asn Thr Thr Pro Ala Pro Pro Pro His Thr Pro Ala 
65 70 75 80 



Pro Pro Thr Thr Arg Pro Thr Thr Pro Pro Pro Thr His Thr His Thr 
85 90 95 



Pro Thr Thr His Thr His Pro Pro Gin His Pro Pro Thr Pro Thr Thr 
100 105 HO 
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Thr Thr Pro Pro His His Ala Pro Thr Pro His Thr Pro Pro Pro Thr 
115 120 125 



Thr Pro Pro Arg Pro Pro Thr Thr His Thr His Thr Pro Pro His Pro 
130 135 140 



Pro Thr Pro Pro Pro Leu Pro Thr Thr Thr Pro His Pro Thr Ser His 
145 150 155 160 



Ser Thr Leu Ser Pro His His Pro His Ser Thr Thr Ser Ser Leu Pro 
165 170 175 



Ser Thr His Asn Asn lie Thr Asn Thr Pro Pro Ala His Thr Leu Thr 
180 185 190 



Pro His Thr Ser 
195 



<210> 225 

<211> 92 

<212> PRT 

<213> Homo sapien 

<400> 225 

Met Thr Ser Leu Pro Glu Gly Pro Arg Ala Ser Glu Asp Gly Ala Thr 
15 10 15 



Pro Glu Ala Gly Gly Phe Thr Asn Ser Ser His Leu Tyr Arg Arg Pro 
20 25 30 



Ala Arg Cys Gin Ala Cys Trp Gin Ala Gin Gly Lys Ala His Ser Thr 
35 40 45 



Ser Arg His Gly Pro Cys Ser His Gly Ala Tyr Ser Leu Ala Arg Gin 
50 55 60 



Thr Arg Asn Lys Lys Leu Gin Ser Ser Val Glu Val Cys Arg Val Val 
65 70 75 80 



Gly Tyr Ser Asp Leu Ala Leu Tyr Thr His Phe Ala 
85 90 



<210> 226 

<211> 42 

<212> PRT 

<213> Homo sapien 
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195 



<400> 226 



Met Lys lie Tyr Gly Ser Val Phe Gin Asn Asp Glu Glu Phe Gin Asp 
15 10 15 



Gly Gly Ser Gly Lys lie Leu Leu Gin Glu Lys Ser Val Leu Gly Pro 
20 25 30 



Met Cys Lys His Leu Leu Arg Asn Leu Glu 
35 40 



<210> 227 

<211> 57 

<212> PRT 

<213> Homo sapien 

<400> 227 

Met Leu Ser Gin Arg Tyr Arg Lys Val Leu Leu Gly Pro Ser Val Thr 
15 10 15 



Leu Ser Phe His lie Pro Thr Leu His Arg Pro Ser Leu Gin Leu Pro 
20 25 30 



Ala Pro Ala Pro His Cys Arg Ser Pro Gly Phe Cys Leu Glu Leu Asn 
35 40 45 



Glu Glu Met Gly Pro Leu Ala Leu Ala 
50 55 



<210> 228 

<211> 205 

<212> PRT 

<213> Homo sapien 

<400> 228 

Gin Gin Gly Lys Leu Val Ala Asp Ser Ala Lys His Leu Gly Leu Lys 
15 10 15 



His Val Val Tyr Ser Gly Leu Glu Asn Val Lys Arg Leu Thr Asp Gly 
20 25 30 



Lys Leu Glu Val Pro His Phe Asp Ser Lys Gly Glu Val Glu Glu Tyr 
35 40 45 



Phe Trp Ser lie Gly He Pro Met Thr Ser Val Arg Val Ala Ala Tyr 
50 55 60 
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Phe Glu Asn Phe Leu Ala Ala Trp Arg Pro Val Lys Ala Ser Asp Gly 
65 70 75 80 



Asp Tyr Tyr Thr Leu Ala Val Pro Met Gly Asp Val Pro Met Asp Gly 
85 90 95 



lie Ser Val Ala Asp lie Gly Ala Ala Val Ser Ser lie Phe Asn Ser 
100 105 110 



Pro Glu Glu Phe Leu Gly Lys Ala Val Gly Leu Ser Ala Glu Ala Leu 
115 120 125 



Thr lie Gin Gin Tyr Ala Asp Val Leu Ser Lys Ala Leu Gly Lys Glu 
130 135 140 



Val Arg Asp Ala Lys lie Thr Pro Glu Ala Phe Glu Lys Leu Gly Phe 
145 150 155 160 



Pro Ala Ala Lys Glu lie Ala Asn Met Cys Arg Phe Tyr Glu Met Lys 
165 170 175 



Pro Asp Arg Asp Val Asn Leu Thr His Gin Leu Asn Pro Lys Val Lys 
180 185 190 



Ser Phe Ser Gin Phe lie Ser Glu Asn Gin Gly Ala Phe 
195 200 205 



<210> 229 

<211> 46 

<212> PRT 

<213> Homo sapien 

<400> 229 

Met Lys Lys Lys Val Leu Ser lie lie Cys He He Gly He His Met 
15 10 15 



Ser Leu His Lys Met Phe Asn Leu Lys Glu He Pro Leu He Leu Tyr 
20 25 30 



Val Leu Leu Ser Val Val Cys Phe Ser Phe Leu He Leu Ser 
35 40 45 



<210> 230 
<2H> 53 
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<212> PRT 

<213> Homo sapien 

<400> 230 

Val Ala Gin Ala Gly Val Gin Trp Arg Asn Ala Asn Ser Leu Gin Pro 
15 10 15 



Ala Pro Ser Trp Leu Lys Gin Ala Leu His Leu Ser Pro Leu Ser Ser 
20 25 30 



Ala His Tyr Arg His Thr Pro Pro His Pro Ala Asn Phe Phe Glu Phe 
35 40 45 



Leu Glu Thr Gly Phe 
50 



<210> 231 

<211> 30 

<212> PRT 

<213> Homo sapien 

<400> 231 

Met Gly Gin Val Gly Val Arg Gly Pro Gly Glu Val Arg Ala Leu Ser 
15 10 15 



Ser Lys Leu Ser Tyr Cys His Val Phe Val Pro Arg Arg Asp 
20 25 30 



<210> 232 

<211> 39 

<212> PRT 

<213> Homo sapien 

<400> 232 

Met Val Phe Leu Gly Glu Leu Lys Thr Phe Ser Leu Val Ser Val Asn 
15 10 15 



Gin Arg Ala Phe Ser Leu Phe Leu Leu Leu lie Pro Ser Ser Pro Val 
20 25 30 



Asn Tyr Phe Ser Phe His Trp 
35 



<210> 233 

<211> 107 

<212> PRT 

<213> Homo sapien 
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<400> 233 



Phe Phe Phe Phe Leu Leu Leu Phe Cys Asp Ser Leu Ala Leu Ser Pro 
15 10 15 



Arg Leu Gin Cys Ser Gly Thr lie Ser Ala His Cys Asn Leu Cys Leu 
20 25 30 



Leu Gly Ser Ser Asn Ser Pro Val Ser Ala Ser Trp Val Ala Gly Thr 
35 40 45 



Thr Gly Ala Cys His His Ala Trp Leu Thr Phe Val Phe Leu Val Glu 
50 55 60 



Thr Gly Phe His His Val Gly Gin Ala Gly Leu Glu Phe Leu Thr Ser 
65 70 75 80 



Gly Asp Pro Pro Ala Leu Ala Ser Gin Ser Ala Glu lie Thr Gly Val 
85 90 95 



Ser His Arg Ala Trp Pro Val Cys Phe Phe Asn 
100 105 



<210> 234 

<211> 57 

<212> PRT 

<213> Homo sapien 

<400> 234 

Met Cys He He Leu Ser Ala His Ala Val Leu Gin Ala Ser Val Pro 
15 10 15 



Leu Ala Val His Val Ser Pro His Ala Arg Ala Gly Pro Ser Trp Ser 
20 25 30 



Ala Leu Val Ser Lys Trp Val Tyr Ala Glu Ala Asp Phe Gin Ser Val 
35 40 45 



Ser Cys Pro Pro He Gin His Ser Arg 
50 55 



<210> 235 

<211> 50 

<212> PRT 

<213> Homo sapien 
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<400> 235 

Met Lys Val Pro Ala Tyr lie Asn His Leu Ala Arg Trp Trp Glu lie 
15 10 15 



Leu Cys Ser Ser Asn Val Leu Leu Val Leu Gly Arg Asp Gly Ala His 
20 25 30 



Ser Gly Ala Lys Glu Asp Lys Lys Ser Met Gin Asn Leu Ser Leu Leu 
35 40 45 



Met Ala 
50 



<210> 236 

<211> 44 

<212> PRT 

<213> Homo sapien 

<400> 236 

Met His Asn Trp Asp Cys Trp Asn Gly Pro Arg His Thr Thr Ala Gly 
15 10 15 



His Cys His Gin Glu Gly Ala Cys Val Leu Glu Gly Ser Gly Gin His 
20 25 30 



Arg Leu Ala Asn Leu Glu Gly Ser Gin Arg Asp Ser 

40 





35 


<210> 


237 


<211> 


146 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


237 


Met Gly Ala Arg Val 


1 


5 



10 15 



Leu Pro Gly Val Gin Ser Gly Ser Pro Leu Ala Asp Leu Met Leu Ser 
20 25 30 



Asp Arg Trp Asp Lys Phe Phe Cys His Ser Ala Gly Leu Cys Pro Glu 
35 40 45 



Ala Ser Leu Leu Ala Gly Cys Ala His Ala Trp Glu Lys Ala Trp Ala 
50 55 60 
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Val Asn Tyr Gly His Thr Cys Ser Leu Cys Gly His Cys Ser Pro Ala 
65 70 75 80 



Pro He Pro He Pro Pro His Pro Thr His Pro Asn Thr His Thr Pro 
85 90 95 



Arg Pro Gin Thr Pro Thr Pro Thr Thr Pro His Pro Pro Thr Pro Thr 
100 105 110 



Pro Pro His Pro Pro Gin His Pro His Pro Arg Pro Pro Pro Thr Ser 
115 120 125 



Thr His Pro Pro Thr His Asn Thr Pro His Thr Thr His His Gin His 
130 135 140 



His His 
145 



<210> 238 

<211> 47 

<212> PRT 

<213> Homo sapien 

<400> 238 

Met Tyr Arg Gin Tyr Gly Pro Trp Cys Thr Asn Ala Ala Ser Gly Arg 
1 5 10 15 



Arg Asp Val Met Asp Gly Arg Gly Arg Gly Thr Phe Asn Pro Ser Ser 
20 25 30 



Pro Phe Pro Pro Ser Gly Ala Ser Tyr Glu He Ser Val His Phe 
35 40 45 



<210> 239 

<211> 91 

<212> PRT 

<213> Homo sapien 

<400> 239 

Met Val Lys He Ser Phe Gin Pro Ala Val Ala Gly He Lys Gly Asp 
15 10 15 



Lys Ala Asp Lys Ala Ser Ala Ser Ala Pro Ala Pro Ala Ser Ala Thr 
20 25 30 
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Glu lie Leu Leu Thr Pro Ala Arg Glu Glu Gin Pro Pro Gin His Arg 
35 40 45 



Ser Lys Arg Gly Gly Ser Val Gly Gly Val Cys Tyr Leu Ser Met Gly 
50 55 60 



Met Val Val Leu Leu Met Gly Leu Val Phe Ala Ser Val Tyr lie Tyr 
65 70 75 80 



Arg Tyr Phe Phe Leu Ala Gin Leu Ala Arg Asp 
85 90 



<210> 240 

<211> 188 

<212> PRT 

<213> Homo sapien 

<400> 240 

Met Arg Leu Val Gly Gly Val Gly Ser Phe Arg Leu Gly Gly Val Gly 
15 10 15 



Cys Gly Gly Gly Gly Gly Gly Ala Gly Ala Gly Ser Trp Val Trp Met 
20 25 30 



Gly Gly Trp Gly Gly Gly Ala Gly Ala Leu Trp Val Ala Val Val Gly 
35 40 45 



Gly Ala Arg Trp Trp Gly Gly Ala Gly Trp Gly Ser Cys Gly Arg Val 
50 55 60 



Leu Val Gly Gly Arg Ala Val Val Val Gly Arg Val Gly Val Val Gly 
65 70 75 80 



Trp Gly Trp Trp Arg Val Val Val Ala Gly Cys Val Cys Gly Gly Gly 
85 90 95 



Trp Arg Trp Trp Arg Ala Gly Val Gly Gly Gly Gly Gly Ala Val Ser 
100 105 110 



Gly Pro Ser Gly Ala Gly Pro Gly Arg Arg Cys Ser Met Val Glu Arg 
115 120 125 



Arg Arg Gly His Val Gly Ser Gly Gly Trp Ala Gly Arg Pro Gly Val 
130 135 140 
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Val Gly Val Trp Ala Arg Cys Val Leu Val Ala Gly Ala Val Trp Arg 
145 150 155 160 



Arg Gly Gly Ala Val Trp Glu Trp Arg Gly Leu Gly Cys Gly Ala Trp 
165 170 175 



Cys Val Gly Arg Ser Trp Gly Glu Cys Gly Gly Arg 
180 185 



<210> 241 

<211> 110 

<212> PRT 

<213> Homo sapien 

<400> 241 

Met Lys Leu Thr Leu Ser Glu Val Lys Met Glu Val He Gly Val Pro 
15 10 15 



Trp Arg Asn Gly Ser His Cys Phe He Ser He Thr Pro Gin Leu Lys 
20 25 30 



Phe Thr Pro Val Ser Gly His Lys Asn Met Arg Lys Glu Pro Cys Cys 
35 40 45 



Phe His Lys Gly Asn His Ser Ser Leu Ser Pro Leu Leu He Asn Leu 
50 55 60 



Lys Ser Trp Thr Pro Ser Phe Leu His Trp Pro Arg Pro Thr Leu Thr 
65 70 75 80 



His Leu Glu Pro Leu Phe Arg Ala Glu Trp His Glu Tyr Val Tyr Leu 
85 90 95 



Gly Arg Asp Gin Ser He Thr Gin Arg Arg Leu Glu Gin His 
100 105 110 



<210> 242 

<211> 102 

<212> PRT 

<213> Homo sapien 

<400> 242 

Met Pro Ser Leu Pro Thr Arg Ser Leu Leu Ser Pro Cys Val Leu Glu 
15 10 15 



Leu Glu Glu Leu Thr Cys Ala Leu Cys Thr Trp Ala Phe Leu Leu Leu 
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20 25 30 



Cys Leu Ala Leu Val Ala Asp Cys Pro Gly Leu Arg Gin Val lie Pro 
35 40 45 



Gly Lys Gin Val Phe Val Leu Phe Ser Met Ser Gly Gly Arg Phe lie 
50 55 60 



Leu Leu Ser Val Ser Ser His Phe Pro lie Pro Phe Lys Lys Leu Trp 
65 70 75 80 



Pro Ala Gin Gly Arg Ala Leu Ser Cys Cys He Thr Ala Glu Pro Thr 
85 90 95 



Cys Pro His Ala Leu Leu 
100 



<210> 243 

<211> 86 

<212> PRT 

<213> Homo sapien 

<400> 243 

Leu Ala Val Ser Leu Cys His Gin Ala Gly Val Gin Trp Cys Asn Pro 
15 10 15 



Gly Ser Leu Gin Pro Pro Pro Pro Gly Phe Lys Arg Phe Phe Cys Leu 
20 25 30 



Cys Leu Pro Ser Ser Trp Gly Tyr Arg His Thr Pro Pro Arg Pro Ala 
35 40 45 



Asn Phe Cys Val Phe Gly Arg Asp Gly Val Ser Pro Cys Trp Pro Gly 
50 55 60 



Trp Ser Leu Ser Leu Asp Val He Cys Asp Pro Pro Arg Gin Pro Pro 
65 70 75 80 



Lys Val Leu Gly Leu Gin 
85 



<210> 244 

<211> 53 

<212> PRT 

<2 13 > Homo sapien 
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<400> 244 

Met Leu Leu Pro Phe Ala Val Arg Gly Leu Leu Thr Met Ala Arg Gly 
15 10 15 



Asp Val Ser Glu lie Gin Val Val Val Ala Ser Trp Ser Thr Gin Leu 
20 25 30 



Ala His Met Gin Glu Glu Gly Leu Trp Pro Leu Ser Arg Ala Gly Gly 
35 40 45 



Leu Leu Pro Gin Ala 
50 



<210> 245 

<211> 183 

<212> PRT 

<213> Homo sapien 

<400> 245 

Leu Thr Pro Ala Gly Val Pro Trp Cys His Leu Gly Ser Leu Gin Pro 
15 10 15 



Leu Pro Pro Arg Phe Lys Ala Val Phe Ser Arg Leu Ala Pro Ser Leu 
20 25 30 



Glu Tyr Ala Trp Asp Tyr Arg Ala Pro Thr Ser His Ala Arg Leu lie 
35 40 45 



Ser Leu Ala Phe Leu Val Glu Thr Gly Phe Ser Pro Thr Val Ala Arg 
50 55 60 



Leu Val Ser Asn Ser Trp Pro Pro Val Val Arg Pro Pro Leu Pro Ser 
65 70 75 80 



Gin Ser Ala Gly lie Thr Gly Val Gly Pro Pro Cys Leu Ala Arg Pro 
85 90 95 



lie Leu Pro Pro His Pro Phe Phe Phe Phe Phe Asp Met Glu Ser His 
100 105 110 



Ala lie Thr Gin Ala Gly Val Gin Trp Arg His Leu Gly Ser Leu Gin 
115 120 125 



Pro Pro Pro Pro Met Phe Lys Ala Ser Ser Cys Leu Ser Leu Leu Ser 
130 135 140 
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Ser Trp Asp Tyr Arg Arg Pro Pro Pro Arg Pro Ala lie Phe Cys lie 
145 150 155 160 



Phe Ser Arg Asp Gly Val Ser Pro Cys Ala Pro Gly Trp Ser Arg Ser 
165 170 175 



Pro Asp Leu Thr Pro Asp Leu 
180 



<210> 246 

<211> 12 

<212> PRT 

<213> Homo sapien 

<400> 246 

Met Ala Pro Asp Thr Asn Thr Phe Leu His Pro Phe 
15 10 



<210> 


247 


<211> 


240 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


247 



Met Gly Asn Cys Gin Ala Gly His Asn Leu His Leu Cys Leu Ala His 
15 10 15 



His Pro Pro Leu Val Cys Ala Thr Leu lie Leu Leu Leu Leu Gly Leu 
20 25 30 



Ser Gly Leu Gly Leu Gly Ser Phe Leu Leu Thr His Arg Thr Gly Leu 
35 40 45 



Arg Ser Pro Asp lie Pro Gin Asp Trp Val Ser Phe Leu Arg Ser Phe 
50 55 60 



Gly Gin Leu Thr Leu Cys Pro Arg Asn Gly Thr Val Thr Gly Lys Trp 
65 70 75 80 



Arg Gly Ser His Val Val Gly Leu Leu Thr Thr Leu Asn Phe Gly Asp 
85 90 95 



Gly Pro Asp Arg Asn Lys Thr Arg Thr Phe Gin Ala Thr Val Leu Gly 
100 105 110 
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Ser Gin Met Gly Leu Lys Gly Ser Ser Ala Gly Gin Leu Val Leu lie 
115 120 125 



Thr Ala Arg Val Thr Thr Glu Arg Thr Ala Gly Thr Cys Leu Tyr Phe 
130 135 140 



Ser Ala Val Pro Gly lie Leu Pro Ser Ser Gin Pro Pro He Ser Cys 
145 150 155 160 



Ser Glu Glu Gly Ala Gly Asn Ala Thr Leu Ser Pro Arg Met Gly Glu 
165 170 175 



Glu Cys Val Ser Val Trp Ser His Glu Gly Leu Val Leu Thr Lys Leu 
180 185 190 



Leu Thr Ser Glu Glu Leu Ala Leu Cys Gly Ser Arg Leu Leu Val Leu 
195 200 205 



Gly Ser Phe Leu Leu Leu Phe Cys Gly Leu Leu Cys Cys Val Thr Ala 
210 215 220 



Met Cys Phe His Pro Arg Arg Glu Ser His Trp Ser Arg Thr Arg Leu 

230 235 240 



225 




<210> 


248 


<211> 


75 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


248 


Met Arg Arg Ala Val 


1 


5 



10 15 



Tyr Thr Gin Glu Ala Arg Arg Tyr Phe Phe Phe Ser Glu Leu Ser Pro 
20 25 30 



Gly Ser Lys Gly Glu Ala Met Gly Asp Pro Gly Met Val Leu Ala Ser 
35 40 45 



Gly Gly Cys Phe Leu Val Thr Gly Val Ser Ser Lys Gin Asn Gly He 
50 55 60 



Arg Met Lys Arg Gly Lys Gly Met Gly His Lys 
65 70 75 
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<210> 249 

<211> 594 

<212> PRT 

<213> Homo sapien 

<400> 249 

Val Pro Gly Arg Lys Leu His Arg Ser Arg Leu Gin Ala Ala Ala Pro 
15 10 15 



Arg Pro Ser Thr Cys Ala Gin Ser Leu Cys Trp Ser Arg Pro Pro Ala 
20 25 30 



Ala Gly Thr Gly Thr Gly Asp Pro Ser Gin Ser Lys Ala Pro Thr Met 
35 40 45 



Ala Met Gly Leu Phe Arg Val Cys Leu Val Val Val Thr Ala lie lie 
50 55 60 



Asn His Pro Leu Leu Phe Pro Arg Glu Asn Ala Thr Val Pro Glu Asn 
65 70 75 80 



Glu Glu Glu He He Arg Lys Met Gin Ala His Gin Glu Lys Leu Gin 
85 90 95 



Leu Glu Gin Leu Arg Leu Glu Glu Glu Val Ala Arg Leu Ala Ala Glu 
100 105 110 



Lys Glu Ala Leu Glu Gin Val Ala Glu Glu Gly Arg Gin Gin Asn Glu 
115 120 125 



Thr Arg Val Ala Trp Asp Leu Trp Ser Thr Leu Cys Met He Leu Phe 
130 135 140 



Leu Met He Glu Val Trp Arg Gin Asp His Gin Glu Gly Pro Ser Pro 
145 150 155 160 



Glu Cys Leu Gly Gly Glu Glu Asp Glu Leu Pro Gly Leu Gly Gly Ala 
165 170 175 



Pro Leu Gin Gly Leu Thr Leu Pro Asn Lys Ala Thr Leu Gly His Phe 
180 185 190 



Tyr Glu Arg Cys He Arg Gly Ala Thr Ala Asp Ala Ala Arg Thr Arg 
195 200 205 
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Glu Phe Leu Glu Gly Phe Val Asp Asp Leu Leu Glu Ala Leu Arg Ser 
210 215 220 



Leu Cys Asn Arg Asp Thr Asp Met Glu Val Glu Asp Phe lie Gly Val 
225 230 235 240 



Asp Ser Met Tyr Glu Asn Trp Gin Val Asp Arg Pro Leu Leu Cys His 
245 250 255 



Leu Phe Val Pro Phe Thr Pro Pro Glu Pro Tyr Arg Phe His Pro Glu 
260 265 270 



Leu Trp Cys Ser Gly Arg Ser Val Pro Leu Asp Arg Gin Gly Tyr Gly 
275 280 285 



Gin lie Lys Val Val Arg Ala Asp Gly Asp Thr Leu Ser Cys He Cys 
290 295 300 



Gly Lys Thr Lys Leu Gly Glu Asp Met Leu Cys Leu Leu His Gly Arg 
305 310 315 320 



Asn Ser Met Ala Pro Pro Cys Gly Asp Met Glu Asn Leu Leu Cys Ala 
325 330 335 



Thr Asp Ser Leu Tyr Leu Asp Thr Met Gin Val Met Lys Trp Phe Gin 
340 345 350 



Thr Ala Leu Thr Arg Ala Trp Lys Gly He Ala His Lys Tyr Glu Phe 
355 360 365 



Asp Leu Ala Phe Gly Gin Leu Asp Ser Pro Gly Ser Leu Lys He Lys 
370 375 380 



Phe Arg Ser Gly Lys Phe Met Pro Phe Asn Leu He Pro Val He Gin 
385 390 395 400 



Cys Asp Asp Ser Asp Leu Tyr Phe Val Ser His Leu Pro Arg Glu Pro 
405 410 415 



Ser Glu Gly Thr Pro Ala Ser Ser Thr Asp Trp Leu Leu Ser Phe Ala 
420 425 430 



Val Tyr Glu Arg His Phe Leu Arg Thr Thr Leu Lys Ala Leu Pro Glu 
435 440 445 
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Gly Ala Cys His Leu Ser Cys Leu Gin lie Ala Ser Phe Leu Leu Ser 
450 455 460 



Lys Gin Ser Arg Leu Thr Gly Pro Ser Gly Leu Ser Ser Tyr His Leu 
465 470 475 480 



Lys Thr Ala Leu Leu His Leu Leu Leu Leu Arg Gin Ala Ala Asp Trp 
485 490 495 



Lys Ala Gly Gin Leu Asp Ala Arg Leu His Glu Leu Leu Cys Phe Leu 
500 505 510 



Glu Lys Ser Leu Leu Gin Lys Lys Leu His His Phe Phe He Gly Asn 
515 520 525 



Arg Lys Val Pro Glu Ala Met Gly Leu Pro Glu Ala Val Leu Arg Ala 
530 535 540 



Glu Pro Leu Asn Leu Phe Arg Pro Phe Val Leu Gin Arg Ser Leu Tyr 
545 550 555 560 



Arg Lys Thr Leu Asp Ser Phe Tyr Glu Met Leu Lys Asn Ala Pro Ala 
565 570 575 



Leu He Ser Glu Tyr Ser Leu His Val Pro Ser Asp Gin Pro Thr Pro 
580 585 590 



Lys Ser 



<210> 250 

<211> 23 

<212> PRT 

<213> Homo sapien 

<400> 250 

Met Tyr Cys He Gly Gly Trp Ala Gly Pro Thr Leu Cys Tyr Val Lys 
15 10 15 



Glu Leu Val Leu Val Leu Gly 
20 



<210> 251 
<211> 213 
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<212> PRT 

<213> Homo sapien 

<400> 251 

Ser Pro His Gin Ala Ala Ala Pro Val Asp Gin Thr Pro Arg Thr Leu 
15 10 15 



Ala Thr Met Gly Gin Arg Ala Leu Pro Ser Ser Leu Ala Leu Leu Ser 
20 25 30 



Arg Pro Leu Ser Pro Pro Pro Ala Ala Cys Ser Gly Asp Pro Gly Cys 
35 40 45 



Gly Ser Gly Ala Gly Leu Pro Ser Ala Ser Ala Ala Ala Gly lie Ala 
50 55 60 



Ser Ser Ala Val Glu Ala Val Cys Gly Asp Ala Ala Pro Ala Cys Leu 
65 70 75 80 



Leu Arg Thr Pro Leu Arg Gly Leu Leu Lys Pro Thr Gly Pro Arg Ser 
85 90 95 



Thr Met Glu Cys Pro Pro Ala Leu lie Val His Pro Pro Thr Gly Gly 
100 105 110 



Met Ala Arg Arg Ala Ala Ser Gin Pro Trp Ala Ala Ala Ser Ala Thr 
115 120 125 



Pro Met Leu Ser Ser Lys Ala Ser Leu Cys lie Pro Thr Glu Arg Pro 
130 135 140 



Pro Pro Gin Pro Leu Met Arg Thr Pro Ala Ala Arg Ser His Trp Pro 
145 150 155 160 



lie Pro His Pro Ala Ser Thr Ala Cys Pro Ala Pro Leu Pro Val Val 
165 170 175 



Leu Val Ala Pro Arg Ser Thr lie Leu Ser Met Ser Arg Thr Trp Thr 
180 185 190 



Cys Ar"9 Arg Trp Ala Val Ala Pro Cys Arg Ala Glu Lys Leu Met Cys 
195 200 205 



Ser Ser Ser Arg Ser 
210 
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<210> 252 

<211> 32 

<212> PRT 

<213> Homo sapien 

<400> 252 

Met His Glu Leu Thr Ala Arg Leu Thr Gin Pro Leu Asn Ser Gly Ser 
15 10 15 



Cys Phe Ser Leu Ala Ala He His His Met Arg Arg Arg Ser Met His 
20 25 30 



<210> 


253 


<211> 


58 


<212> 


PRT 


<213> 


Homo 


<400> 


253 



Met Ser Leu Gin Leu Gin He Leu Asn Val Ser Pro Val He Trp His 
15 10 15 



Phe Arg His Ser Tyr Leu Lys Pro Gin Phe Ser Leu Pro Val Lys Trp 
20 25 30 



Gly He He He Pro He Leu Pro Arg Leu Leu Lys Gly Leu Ser Glu 
35 40 45 



Leu He Cys Lys Met Leu Asn Arg Thr Gin 
50 55 



<210> 254 

<211> 34 

<212> PRT 

<213> Homo sapien 

<400> 254 

Met Gly Ser Ala Phe Val Leu Leu Ser Trp Arg Ala Cys Leu Cys Cys 
15 10 15 



Arg Ala Val Ser Val Val Gly He Ala Leu Leu Phe Pro Ala Thr Gly 
20 25 30 



Gin He 
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<210> 255 

<211> 74 

<212> PRT 

<213> Homo sapien 

<400> 255 

Lys Arg Phe Phe Phe Phe Pro Ala Pro lie Phe Cys Lys Thr Glu Val 
15 10 15 



Pro Glu His Arg Arg Ser Ser Ser Gin Ala Asn Phe He Lys Lys Lys 
20 25 30 



Leu Glu Val Cys Phe Asp Phe Ala Val He Cys Phe He Thr Ser He 
35 40 45 



Phe Gly Glu Gin Pro Gin Leu Leu He Phe Met Glu Lys Tyr Phe Gin 
50 55 60 



Val Gin Gly Gin Tyr He Ser Gin Ser Glu 
65 70 



<210> 256 

<211> 34 

<212> PRT 

<213> Homo sapien 

<400> 256 

Met He Lys Val Cys Val Pro He Thr Phe Pro Leu Pro Glu Arg Arg 
15 10 15 



Val Ser Arg Lys He Asn Ser He Leu Asp Ala Gly Thr Ser Pro Arg 
20 25 30 



Pro Arg 



<210> 257 

<211> 37 

<212> PRT 

<213> Homo sapien 

<400> 257 

Met Asn Ser Ser Asn Arg Arg Leu Phe Trp Lys Lys Ser Gin Gly Leu 
15 10 15 



Ser Pro Ser Trp Val Ala Pro Tyr Lys Ser Asn Ser Ser Ser Gly Ser 
20 25 30 
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Leu Val Tyr Pro Leu 





35 


<210> 


258 


<2X1> 


73 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


258 


Met Glu Phe Leu Leu 


1 


5 



10 15 



Asp Val lie Pro Thr Arg Gly Leu Arg Gly Lys Leu Lys Asp lie Lys 
20 25 30 



Gin Ser Asn Leu Val lie Val Lys Thr He Tyr Val Gly His Arg Thr 
35 40 45 



Glu Asp Gin Val Ser Lys Glu Asp Gly Ser Val Pro Phe Val Ser Pro 
50 55 60 



Val Pro Lys Ala Val Phe Gly Ala Ser 

70 



65 




<210> 


259 


<211> 


1533 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


259 



.Met Tyr He Arg Val Ser Tyr Asp Thr Lys Pro Asp Ser Leu Leu His 
15 10 15 



Leu Met Val Lys Asp Trp Gin Leu Glu Leu Pro Lys Leu Leu He Ser 
20 25 30 



Val His Gly Gly Leu Gin Asn Phe Glu Met Gin Pro Lys Leu Lys Gin 
35 40 45 



Val Phe Gly Lys Gly Leu He Lys Ala Ala Met Thr Thr Gly Ala Trp 
50 55 60 



lie Phe Thr Gly Gly Val Ser Thr Gly Val He Ser His Val Gly Asp 
65 70 75 80 
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Ala Leu Lys Asp His Ser Ser Lys Ser Arg Gly Arg Val Cys Ala lie 
85 90 95 



Gly lie Ala Pro Trp Gly lie Val Glu Asn Lys Glu Asp Leu Val Gly 
100 105 HO 



Lys Asp Val Thr Arg Val Tyr Gin Thr Met Ser Asn Pro Leu Ser Lys 
115 120 125 



Leu Ser Val Leu Asn Asn Ser His Thr His Phe He Leu Ala Asp Asn 
130 135 140 



Gly Thr Leu Gly Lys Tyr Gly Ala Glu Val Lys Leu Arg Arg Leu Leu 
145 150 155 160 



Glu Lys His He Ser Leu Gin Lys He Asn Thr Arg Leu Gly Gin Gly 
165 170 175 



Val Pro Leu Val Gly Leu Val Val Glu Gly Gly Pro Asn Val Val Ser 
180 185 190 



He Val Leu Glu Tyr Leu Gin Glu Glu Pro Pro He Pro Val Val He 
195 200 205 



Cys Asp Gly Ser Gly Arg Ala Ser Asp He Leu Ser Phe Ala His Lys 
210 215 220 



Tyr Cys Glu Glu Gly Gly He He Asn Glu Ser Leu Arg Glu Gin Leu 
225 230 235 240 



Leu Val Thr He Gin Lys Thr Phe Asn Tyr Asn Lys Ala Gin Ser His 
245 250 255 



Gin Leu Phe Ala He He Met Glu Cys Met Lys Lys Lys Glu Leu Val 
260 265 270 



Thr Val Phe Arg Met Gly Ser Glu Gly Gin Gin Asp He Glu Met Ala 
275 280 285 



He Leu Thr Ala Leu Leu Lys Gly Thr Asn Val Ser Ala Pro Asp Gin 
290 295 300 



Leu Ser Leu Ala Leu Ala Trp Asn Arg Val Asp He Ala Arg Ser Gin 
305 310 315 320 
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lie Phe Val Phe Gly Pro His Trp Thr Pro Leu Gly Ser Leu Ala Pro 
325 330 335 



Pro Thr Asp Ser Lys Ala Thr Glu Lys Glu Lys Lys Pro Pro Met Ala 
340 345 350 



Thr Thr Lys Gly Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Gly Lys 
355 360 365 



Val Lys Glu Glu Val Glu Glu Glu Thr Asp Pro Arg Lys lie Glu Leu 
370 375 380 



Leu Asn Trp Val Asn Ala Leu Glu Gin Ala Met Leu Asp Ala Leu Val 
385 390 395 400 



Leu Asp Arg Val Asp Phe Val Lys Leu Leu lie Glu Asn Gly Val Asn 
405 410 415 



Met Gin His Phe Leu Thr lie Pro Arg Leu Glu Glu Leu Tyr Asn Thr 
420 425 430 



Arg Leu Gly Pro Pro Asn Thr Leu His Leu Leu Val Arg Asp Val Lys 
435 440 445 



Lys Ser Asn Leu Pro Pro Asp Tyr His lie Ser Leu lie Asp lie Gly 
450 455 460 



Leu Val Leu Glu Tyr Leu Met Gly Gly Ala Tyr Arg Cys Asn Tyr Thr 
465 470 475 480 



Arg Lys Asn Phe Arg Thr Leu Tyr Asn Asn Leu Phe Gly Pro Lys Arg 
485 490 495 



Pro Lys Ala Leu Lys Leu Leu Gly Met Glu Asp Asp Glu Pro Pro Ala 
500 505 510 



Lys Gly Lys Lys Lys Lys Lys Lys Lys Lys Glu Glu Glu lie Asp lie 
515 520 525 



Asp Val Asp Asp Pro Ala Val Ser Arg Phe Gin Tyr Pro Phe His Glu 
530 535 540 



Leu Met Val Trp Ala Val Leu Met Lys Arg Gin Lys Met Ala Val Phe 
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545 550 555 560 



Leu Trp Gin Arg Gly Glu Glu Ser Met Ala Lys Ala Leu Val Ala Cys 
565 570 575 



Lys Leu Tyr Lys Ala Met Ala His Glu Ser Ser Glu Ser Asp Leu Val 
580 585 590 



Asp Asp lie Ser Gin Asp Leu Asp Asn Asn Ser Lys Asp Phe Gly Gin 
595 600 605 



Leu Ala Leu Glu Leu Leu Asp Gin Ser Tyr Lys His Asp Glu Gin lie 
610 615 620 



Ala Met Lys Leu Leu Thr Tyr Glu Leu Lys Asn Trp Ser Asn Ser Thr 
625 630 635 640 



Cys Leu Lys Leu Ala Val Ala Ala Lys His Arg Asp Phe lie Ala His 
645 650 655 



Thr Cys Ser Gin Met Leu Leu Thr Asp Met Trp Met Gly Arg Leu Arg 
660 665 670 



Met Arg Lys Asn Pro Gly Leu Lys Val lie Met Gly lie Leu Leu Pro 
675 680 685 



Pro Thr lie Leu Phe Leu Glu Phe Arg Thr Tyr Asp Asp Phe Ser Tyr 
690 695 700 



Gin Thr Ser Lys Glu Asn Glu Asp Gly Lys Glu Lys Glu Glu Glu Asn 
705 710 715 720 



Thr Asp Ala Asn Ala Asp Ala Gly Ser Arg Lys Gly Asp Glu Glu Asn 
725 730 735 



Glu His Lys Lys Gin Arg Ser He Pro He Gly Thr Lys He Cys Glu 
740 745 750 



Phe Tyr Asn Ala Pro He Val Lys Phe Trp Phe Tyr Thr He Ser Tyr 
755 760 765 



Leu Gly Tyr Leu Leu Leu Phe Asn Tyr Val He Leu Val Arg Met Asp 
770 775 780 
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Gly Trp Pro Ser Leu Gin Glu Trp He Val He Ser Tyr He Val Ser 
785 790 795 800 



Leu Ala Leu Glu Lys He Arg Glu He Leu Met Ser Glu Pro Gly Lys 
805 810 815 



Leu Ser Gin Lys He Lys Val Trp Leu Gin Glu Tyr Trp Asn He Thr 
820 825 830 



Asp Leu Val Ala He Ser Thr Phe Met lie Gly Ala He Leu Arg Leu 
835 840 845 



Gin Asn Gin Pro Tyr Met Gly Tyr Gly Arg Val He Tyr Cys Val Asp 
850 855 860 



He He Phe Trp Tyr He Arg Val Leu Asp He Phe Gly Val Asn Lys 
865 870 875 880 



Tyr Leu Gly Pro Tyr Val Met Met He Gly LyB Met Met He Asp Met 
885 890 895 



Leu Tyr Phe Val Val He Met Leu Val Val Leu Met Ser Phe Gly Val 
900 905 910 



Ala Arg Gin Ala He Leu His Pro Glu Glu Lys Pro Ser Trp Lys Leu 
915 920 925 



Ala Arg Asn lie Phe Tyr Met Pro Tyr Trp Met He Tyr Gly Glu Val 
930 935 940 



Phe Ala Asp Gin He Asp Leu Tyr Ala Met Glu He Asn Pro Pro Cys 
945 950 955 960 



Gly Glu Asn Leu Tyr Asp Glu Glu Gly Lys Arg Leu Pro Pro Cys lie 
965 970 975 



Pro Gly Ala Trp Leu Thr Pro Ala Leu Met Ala Cys Tyr Leu Leu Val 
980 985 990 



Ala Asn He Leu Leu Val Asn Leu Leu lie Ala Val Phe Asn Asn Thr 
995 1000 1005 



Phe Phe Glu Val Lys Ser He Ser Asn Gin Val Trp Lys Phe Gin 
1010 1015 1020 
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Arg Tyr Gin Leu He Met Thr Phe His Asp Arg Pro Val Leu Pro 
1025 1030 1035 



Pro Pro Met He He Leu Ser His He Tyr He He He Met Arg 
1040 1045 1050 



Leu Ser Gly Arg Cys Arg Lys Lys Arg Glu Gly Asp Gin Glu Glu 
1055 1060 1065 



Arg Asp Arg Gly Leu Lys Leu Phe Leu Ser Asp Glu Glu Leu Lys 
1070 1075 1080 



Arg Leu His Glu Phe Glu Glu Gin Cys Val Gin Glu His Phe Arg 
1085 1090 1095 



Glu Lys Glu Asp Glu Gin Gin Ser Ser Ser Asp Glu Arg He Arg 
1100 1105 1110 



Val Thr Ser Glu Arg Val Glu Asn Met Ser Met Arg Leu Glu Glu 
1115 1120 1125 



He Asn Glu Arg Glu Thr Phe Met Lys Thr Ser Leu Gin Thr Val 
1130 1135 1140 



Asp Leu Arg Leu Ala Gin Leu Glu Glu Leu Ser Asn Arg Met Val 
1145 1150 1155 



Asn Ala Leu Glu Asn Leu Ala Gly He Asp Arg Ser Asp Leu He 
1160 1165 1170 



Gin Ala Arg Ser Arg Ala Ser Ser Glu Cys Glu Ala Thr Tyr Leu 
1175 1180 1185 



Leu Arg Gin Ser Ser He Asn Ser Ala Asp Gly Tyr Ser Leu Tyr 
1190 1195 1200 



Arg Tyr His Phe Asn Gly Glu Glu Leu Leu Phe Glu Asp Thr Ser 
1205 1210 1215 



Leu Ser Thr Ser Pro Gly Thr Gly Val Arg Lys Lys Thr Cys Ser 
1220 1225 1230 



Phe Arg He Lys Glu Glu Lys Asp Val Lys Thr His Leu Val Pro 
1235 1240 1245 
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PCT7US01/43612 



Glu Cys Gin Asn Ser Leu His Leu Ser Leu Gly Thr Ser Thr Ser 
1250 1255 1260 



Ala Thr Pro Asp Gly Ser His Leu Ala Val Asp Asp Leu Lys Asn 
1265 1270 1275 



Ala Glu Glu Ser Lys Leu Gly Pro Asp lie Gly lie Ser Lys Glu 
1280 1285 1290 



Asp Asp Glu Arg Gin Thr Asp Ser Lys Lys Glu Glu Thr He Ser 
1295 1300 1305 



Pro Ser Leu Asn Lys Thr Asp Val He His Gly Gin Asp Lys Ser 
1310 1315 1320 



Asp Val Gin Asn Thr Gin Leu Thr Val Glu Thr Thr Asn He Glu 
1325 1330 1335 



Gly Thr He Ser Tyr Pro Leu Glu Glu Thr Lys lie Thr Arg Tyr 
1340 1345 1350 



Phe Pro Asp Glu Thr He Asn Ala Cys Lys Thr Met Lys Ser Arg 
1355 1360 1365 



Ser Phe Val Tyr Ser Arg Gly Arg Lys Leu Val Gly Gly Val Asn 
1370 1375 1380 



Gin Asp Val Glu Tyr Ser Ser He Thr Asp Gin Gin Leu Thr Thr 
1385 1390 1395 



Glu Trp Gin Cys Gin Val Gin Lys lie Thr Arg Ser His Ser Thr 
1400 1405 1410 



Asp He Pro Tyr lie Val Ser Glu Ala Ala Val Gin Ala Glu Gin 
1415 1420 1425 



Lys Glu Gin Phe Ala Asp Met Gin Asp Glu His His Val Ala Glu 
1430 1435 1440 



Ala He Pro Arg lie Pro Arg Leu Ser Leu Thr He Thr Asp Arg 
1445 1450 1455 



Asn Gly Met Glu Asn Leu Leu Ser Val Lys Pro Asp Gin Thr Leu 
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1460 1465 1470 



Gly Phe Pro Ser Leu Arg Ser Lys Ser Leu His Gly His Pro Arg 
1475 1480 1485 



Asn Val Lys Ser He Gin Gly Lys Leu Asp Arg Ser Gly His Ala 
1490 1495 1500 



Ser Ser Val Ser Ser Leu Val He Val Ser Gly Met Thr Ala Glu 
1505 1510 1515 



Glu Lys Lys Val Lys Lys Glu Lys Ala Ser Thr Glu Thr Glu Cys 
1520 1525 1530 



<210> 260 

<211> 92 

<212> PRT 

<213> Homo sapien 

<400> 260 

Met He He Leu Val Val Gly Arg He Thr Arg Gly Asn Ala Leu Tyr 
15 10 15 



Ser Gin Glu Glu Cys Cys Val Cys Thr Thr Gin Leu Thr Thr Trp Val 
20 25 30 



Val Cys Ser Thr Leu His Cys Val Ser He Leu Trp Ser Val Arg Pro 
35 40 45 



Ser Leu Ser Glu Gly Gly Tyr Leu Pro Leu Ala Ala Ser Val Ser Ala 
50 fc 55 60 



Ala He Val Val Cys Phe Val Cys Val Cys Val Val Ser Cys His Asp 
65 70 75 80 



Ala Thr He Leu Leu Arg lie Gly Asn Phe Gly Gly 
85 90 



<210> 261 

<211> 66 

<212> PRT 

<213> Homo sapien 

<400> 261 

Met Glu Leu Leu Thr Asp Lys Gly Glu He Leu Asp Leu Glu Pro Phe 
15 10 15 
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Pro Ala lie Leu Leu Phe Ser Leu Cys Leu Oly Ser Trp Phe His Ser 
20 25 30 



Ala Arg His Glu Gly Pro Phe Gin Phe Asp Asp lie Arg Leu Leu Thr 
35 40 45 



Leu Ser Trp Met Pro Cys Cys Leu Gin Gin His Asp Phe Thr Val Cys 
50 55 60 



Phe Ser 



65 




<210> 


262 


<211> 


90 


<212> 


PRT 


<213> 


Homo 


<400> 


262 


Met Trp Asn 


1 





10 15 



Ser Pro Glu Pro Ser His Pro Gly Ser Val Arg Val Pro Arg Ala Val 
20 25 30 



Ala Pro His Pro Pro Pro Thr Gly Pro Cys Ser Trp Ser Cys Cys Asp 
35 40 45 



Ser Phe lie lie Pro Trp Ala Gly Val Gly Leu Ser Leu Cys Phe Cys 
50 55 60 



Leu Leu Phe Lys Glu Asp Glu Val Ser Met Glu Asn Lys Thr Asn Val 
65 70 75 80 



Val Thr Pro Ser Leu Arg Arg Val His Cys 
85 90 



<210> 263 

<211> 13 

<212> PRT 

<213> Homo sapien 

<400> 263 

Met Ser Gly Gin Pro Arg Pro Thr Ser Pro Cys Val Leu 
15 10 
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<210> 264 

<211> 100 

<212> PRT 

<213> Homo sapien 

<400> 264 

Phe Phe Leu Arg Trp Ser Leu Ala Gin Val Ala Gin Ala Ala Arg Gin 
15 10 15 



Trp Leu Asn Leu Ser Ser Leu Gin Pro Pro Pro Pro Gly Phe Lys Arg 
20 25 30 



Phe Ser Cys Leu Gly Leu Leu Ser Ser Trp Asp Tyr Arg His Ala Pro 
35 40 45 



Pro Arg Pro Ala lie Phe Val Phe Leu Val Glu Met Gly Phe His His 
50 55 60 



lie Val Gin Ala Gly Leu Lys Pro Leu Thr Ser Gly Asp Leu Ala Thr 
65 70 75 80 



Ser Ala Phe Gin Ser Ala Glu He He Gly Val Ser His Cys Ala Gin 
85 90 95 



Pro Gin Lys Ser 
100 



<210> 


265 


<211> 


10 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


265 


Met Lys Gly Lys He 


1 


5 


<210> 


266 


<211> 


43 


<212> 


PRT 


<213> 


Homo sapien 


<400> 


266 



10 



Met Ser Pro Glu Pro Ser His Phe Ser Pro Pro Ala Pro Pro Ser Phe 
15 10 15 



Ser Pro Thr His Pro Ser Leu Pro Leu Thr Trp He Ser Ala Pro Ala 
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20 25 30 



Ala Ser Pro Leu Pro Leu Leu Leu Pro Thr Phe 
35 40 



<210> 267 

<211> 124 

<212> PRT 

<213> Homo sapien. 

<400> 267 

Met Val Phe Tyr Cys lie Leu Phe Leu Gin Leu lie Gin Phe Cys Met 
•1 5 10 15 



Ser Phe Leu Ser Phe Leu Gly Glu Asn He Leu Cys Gin Leu Phe Ser 
20 25 30 



Thr Val Leu His Tyr He Leu Lys Gin Gly Cys Gin Leu Glu Thr Gin 
35 40 45 



Pro Ser Asp Tyr Lys Ala Gin Asn Val Thr Phe Asn Cys Ala Pro Pro 
50 55 60 



Gly Gly Leu Ala Leu Gly Lys Asp Gly Glu Arg Asn He Leu Arg Tyr 
65 70 75 80 



Glu His Phe Leu Phe Cys Leu Gin Cys Cys Asp Leu Val Gin Gin Leu 
85 90 95 



Gin Asn Cys Ser His Leu Asn Arg Cys Ser Phe Ser Phe Phe Thr Leu 
100 105 110 



Leu Tyr Lys Arg Leu Val Ser Gin Leu His Tyr His 
115 120 



<210> 268 

<211> 67 

<212> PRT 

<213> Homo sapien 

<400> 268 

Met Pro Glu Phe His Pro His Ser Leu Glu Leu Phe Thr Tyr Ser Pro 
15 10 15 



Ser Gin Glu Leu Leu Asp Glu His Gin Glu Met Arg Phe Lys Tyr Asn 
20 25 30 
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Thr Glu Lys Cys Ala Gin Ala Gly Tyr His Pro Cys Trp Asn Leu Ala 
35 40 45 



Leu Ala Asn Trp Ala Thr Arg Val Pro Ala Arg Ala Asp Pro Ser Gin 
50 55 60 



Ser Ala Gly 
65 



<210> 269 

<211> 23 

<212> PRT 

<213> Homo sapien 

<400> 269 

Met Thr Asp Leu Lys Glu Asn Ser Lys Ala Asp Leu Glu Asn Leu Leu 
15 10 15 



Leu Phe Leu Ser Pro Asn Pro 
20 



<210> 270 

<211> 46 

<212> PRT 

<213> Homo sapien 

<400> 270 

Met Glu Asn Leu Ser Ser lie Ser Glu Val Val Asn Ala He Ser Gly 
15 10 15 



He Gin Arg Leu Ala Val Lys Ser Ser Leu Gly Ser Leu Tyr Leu Thr 
20 25 30 



Phe Phe Leu Val Ser He Leu Lys Met Gin Ser His He Leu 
35 40 45 



<210> 271 

<211> 15 

<212> PRT 

<213> Homo sapien 

<400> 271 

Met Thr Glu Glu Gly Glu Ser Leu Ser Gly Gin Ser Leu Gly Trp 
15 10 15 
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<210> 272 

<211> 46 

<212> PRT 

<213> Homo sapien 

<400> 272 

Met Pro Ser Ala Arg Met Ser Asp Gly Leu Val Ala Ala Glu Val Gin 
15 10 15 



Ser Pro Val lie Phe Leu Phe Gly Pro lie Trp Leu Leu lie Leu Met 
20 25 30 



40 45 





35 


<210> 


273 


<211> 


32 


<212> 


PRT 


<213> 


Homo 


<400> 


273 



Met Gly Arg Ala Leu Pro Leu Ser Ala Ala Pro Ser Leu Ser Leu Cys 
15 10 15 



Leu Pro Ala Gin Lys Arg Trp Leu Trp Pro Arg Gly Ser Gly Arg Asp 
20 25 30 



<210> 274 

<211> 224 

<212> PRT 

<213> Homo sapien 

<400> 274 

Met Ala Val Gly Asn lie Asn Glu Leu Pro Glu Asn lie Leu Leu Glu 
15 10 15 



Leu Phe Thr His Val Pro Ala Arg Gin Leu Leu Leu Asn Cys Arg Leu 
20 25 30 



Val Cys Ser Leu Trp Arg Asp Leu lie Asp Leu Val Thr Leu Trp Lys 
35 40 45 



Arg Lys Cys Leu Arg Glu Gly Phe He Thr Glu Asp Trp Asp Gin Pro 
50 55 60 



Val Ala Asp Trp Lys lie Phe Tyr Phe Leu Arg Ser Leu His Arg Asn 
65 70 75 80 
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Leu Leu His Asn Pro Cys Ala Glu Glu Gly Phe Glu Phe Trp Ser Leu 
85 90 95 



Asp Val Asn Gly Gly Asp Glu Trp Lys Val Glu Asp Leu Ser Arg Asp 
100 105 110 



Gin Arg Lys Glu Phe Pro Asn Asp Gin Val Arg Ser Gin Ala Arg Leu 
115 120 125 



Arg Val Gin Val Pro Ala Val Arg Ser Ala Pro Val Val Arg Ala Arg 
130 135 140 



Ala Ser Gly Asp Leu Pro Ala Arg Pro Gly Asp His Pro Ala Glu Glu 
145 150 155 160 



Arg Cys Gin Val Glu Gly Gly Leu Pro His lie Leu Gin Leu Pro Ala 
165 170 175 



Arg Arg Pro Leu His Leu Val Ser Ala Arg Arg Arg Gly His Ser Leu 
180 185 190 



Leu Gly Arg Leu Val Arg Pro Glu Gly His Gin Gin Gin His His His 
195 200 205 



Arg Ala Pro Ala Ala Leu Thr Pro Pro Glu Pro Pro Ser Ala Glu Pro 
210 215 220 



<210> 275 

<211> 33 

<212> PRT 

<213> Homo sapien 

<400> 275 

Met Gly Gly Gin Ala Thr Arg Tyr Tyr lie lie Asn lie Leu Ser Gly 
15 10 15 



Lys lie Ser Leu Phe Arg Ala lie Arg Gin Val Ala Lys Asn Phe lie 
20 25 30 



Leu 



<210> 276 
<211> 77 
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<212> PRT 

<213> Homo sapien 

<400> 276 

Met Asn Gly Lys Thr Lys Val Glu Arg Asn He Leu Ser Tyr He He 
15 10 15 



Leu Gin He Lys Thr Phe Lys Asn Gin He Val Phe Leu Val Leu Arg 
20 25 30 



Thr Asn Arg Lys Cys Leu He He Tyr Phe He Ser Thr Arg Gin Lys 
35 40 45 



Tyr Ser Tyr Ala Ala Asp Val Arg Glu Gly Gly Glu Phe Pro Gin Pro 
50 55 60 



Ser Met Lys Lys Asp Lys Gly Pro Tyr Pro Leu Ala Val 
65 70 75 



<210> 277 

<211> 39 

<212> PRT 

<213> Homo sapien 

<400> 277 

Met Tyr Val Arg Ser He His Leu Lys Ser Met Val Gin He Ala Lys 
15 10 15 



He Gly Pro Gly Glu Thr Cys Ser His Phe Leu Lys Thr Cys Thr Ser 
20 25 30 



Ala Ala Asn His Ala Thr Pro 
35 



<210> 278 

<211> 26 

<212> PRT 

<213> Homo sapien 

<400> 278 

Met Pro He Arg Leu Cys Val Cys Ala Arg Phe Leu Lys Thr Ala Asn 
15 10 15 



Tyr He Val Ser Ser Gin Met Ser Gly Phe 
20 25 
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<210> 279 
<211> 149 
<212> PRT 
<213> Homo aapien 

<400> 279 

Met Leu Val Phe Ser Ala Gly Arg Leu Ala Cys Trp Arg Ala Val Cys 
15 10 15 



Trp Leu Gly Arg Cys Ala Cys Ala Ser Ser Arg Val Cys Leu Arg Leu 
20 25 30 



Val Leu Ser Trp Ser Arg Val Val Cys Phe Trp Trp Ser Phe Trp Leu 
35 40 45 



Phe Val Ser Val Val Cys Phe Val Phe Ser Cys Phe Val Ser Leu Leu 
50 55 60 



Cys Cys Cys Gly Val Arg Leu Tyr Phe Val Val Ser Trp Gly Val Phe 
65 70 75 80 



Phe Cys Asp Leu Leu Arg Cys Cys Tyr Asp Asn Val Cys Phe Ala His 
85 90 95 



Pro Thr Val Cys Phe Ser Ser Cys Pro Phe Phe Gly Val Leu Asn Tyr 
100 105 110 



Val Phe Phe lie Leu Phe Pro His Trp Gly Val Cys Val Gly Gly Val 
115 120 125 



Val Pro Phe Ala Ala Val Phe Ser Gly Phe Phe Trp Ser Cys Pro Cys 
130 135 140 



Phe Val Ala Ala Arg 



145 




<210> 


280 


<211> 


54 


<212> 


PRT 


<213> 


Homo 


<400> 


280 



Met lie Leu Lys Gly Thr Leu Thr He Tyr Asn Lys Ser Phe Gin Tyr 
15 10 15 



Tyr Ser Ser Ser Leu Thr Ser Glu Ser Leu Val Tyr Val He Leu Ser 
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20 25 30 



Arg Lys Lys Thr Thr Tyr Lys Ser His Phe Pro Thr Lys Leu lie Gin 
35 40 45 



His Pro Thr Leu Lys lie 
50 



<210> 281 

<211> 114 

<212> PRT 

<213> Homo sapien 

<400> 281 



Val Ala Gly He Thr Gly He His His His Thr Gin Leu Phe Phe Cys 
15 10 15 



He Phe Val Arg Asp Arg Phe Leu His Val Gly Gin Ala Gly Leu Glu 
20 25 30 



Leu Pro Thr Ser Gly Asp Pro Pro Thr Ser Ala Ser Gin Ser Asp Asp 
35 40 45 



Phe He Phe He Phe Asn Cys He Asn Leu His Leu Asp Asn Asp Phe 
50 55 60 



Val Lys Gly Val Cys Cys Val Gin Asn Leu Arg Tyr Trp Leu Arg Val 
65 70 75 80 



Lys Tyr He He Phe He He Cys Trp Val Ala Ser Ser Tyr Ala Ala 
85 90 95 



Phe Phe Leu Ser Thr Phe He Lys Ser Ser Phe Leu Lys Leu Phe He 
100 105 110 



He Phe 



<210> 282 

<211> 171 

<212> PRT 

<213> Homo sapien 

<400> 282 

Met Leu Phe Cys He Phe Thr Val Tyr Cys Phe Tyr Asn Lys Tyr Lys 
15 10 15 
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Met Lys Met Phe Met Leu Thr Lys Arg Thr Lys Asn Asn Lys Gin Gin 
20 25 30 



Lys Thr Lys Gly Trp Gly Cys His Thr Cys Gly Pro Lys Ala Gly Phe 
35 40 45 



Pro Gly Gly Gly His Leu Val Leu Ser Arg Pro His Asn Ser Pro Pro 
50 55 60 



Lys Tyr Tyr Arg Glu Thr Thr Gly Arg Thr Thr Gin His Thr Lys Arg 
65 70 75 80 



His Asn Thr Gin Asn His His Thr Ala Thr Pro Ala His Arg Arg Gin 
85 90 95 



Arg Thr Arg Arg Glu Gin Lys Glu Lys Gly Gin Gin Lys Lys Ala Ser 
100 105 110 



Ser Thr lie Thr Thr Gin Ser His Asp Lys Lys Arg Arg Thr Met Thr 
115 120 125 



Lys Thr Ser Ser Ser Thr Arg His Arg Gin Asp Lys Ser Lys Lys Asp 
130 135 140 



Arg Thr Arg Gin Lys Thr Thr Arg Asp Glu Thr Thr Lys Lys Pro His 
145 150 155 160 



Lys Lys Ala Ser Glu Asn Lys Asn Gin Leu Thr 
165 170 



<210> 283 

<211> 90 

<212> PRT 

<213> Homo sapien 

<400> 283 

Met Asn Ala Thr Val Leu Ser Val Phe Lys Ala Lys Leu Leu Trp Lys 
15 10 15 



Leu Gly Gly Gly Pro Pro Cys Gly Pro Pro Ala Ala Leu Cys Leu Pro 
20 25 30 



Leu Gly Ala Pro Glu Leu Met Pro Val Val He Ser Ala Met Leu Asp 
35 40 45 
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Ala Arg Ser Gin Arg Ser Ala Ser Leu Ser Gin Leu Ala Cys Ala Ala 
50 55 60 



Leu Thr Trp Leu Pro Ala Val Leu Arg Aan Leu His Trp Trp Asp Lys 
65 70 75 80 



Gly Met Lys Arg He Asn Lys Asp Leu Lys 
85 90 



<210> 284 

<211> 154 

<212> PRT 

<213> Homo sapien 

<400> 284 

Lys Glu Ala Pro Ser Ser Gin Asp He Leu Val Phe Leu Thr Gly Gin 
15 10 15 



Glu Glu He Glu Ala Met Ser Lys Thr Cys Arg Asp He Ala Lys His 
20 25 30 



Leu Pro Asp Gly Cys Pro Ala Met Leu Val Leu Pro Leu Tyr Ala SefV. 
35 40 45 



Leu Pro Tyr Ala Gin Gin Leu Arg Val Phe Gin Gly Ala Pro Lys Gly 
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Tyr Arg Lys Val He He Ser Thr Asn He Ala Glu Thr Ser He Thr 
65 70 75 80 



He Thr Gly He Lys Tyr Val Val Asp Thr Gly Met Val Lys Ala Lys 
85 90 95 



Lys Tyr Asn Pro Asp Ser Gly Leu Glu Val Leu Ala Val Gin Arg Val 
100 105 110 



Ser Lys Thr Gin Ala Trp Gin Arg Thr Gly Arg Ala Gly Arg Glu Asp 
115 120 125 



Ser Gly He Cys Tyr Arg Leu Tyr Thr Glu Asp Glu Phe Glu Lys Phe 
130 135 140 



Asp Lys Met Thr Val Pro Glu He Gin Arg 
145 150 
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